
PREFACE 


After the appearance in 1952 of my “Introduction to Metamathematics”, 
written for students at the first-year graduate level, I had no expectation of 
writing another text. But various occasions arose which required me to 
think about how to present parts of the same material more briefly, to a 
more general audience, or to students at an earlier educational level.* 
These newer expositions were received well enough that I was persuaded 
to prepare the present book for undergraduate students in the Junior year. 

In “Introduction to Metamathematics”, the study of mathematical logic 
begins properly only in Chapter V (with some definitions in Chapter IV). 
Graduate students in mathematics can cover rapidly the introductory 
material which precedes it there. But for less advanced students or in 
shorter courses, too much time would be used by such a thorough intro- 
duction. I am now convinced that it is also sound pedagogically and 
scientifically to start doing logic (correctly) right from the beginning, even 
if not all the reasons for doing it nor all the criteria governing how it is 
done have been enumerated in advance. The rest of the “introduction” 
can come later. 

On this basis. Part I (Chapters I— III) of the present book gives quite a 
thorough, yet elementary, treatment of mathematical logic of first order 
(substantially equivalent to Chapters V-VII and § 73 of IM). The treat- 
ment does not stop at formulating logic in one way and practicing with 
that, as one might do at an even more elementary level. Modern logicians 
characteristically work with their material in a flexible manner, using 
different formulations, and passing from one to another, as suits the 
purpose at hand. Thus in Part I the student will first meet model theory 
(truth tables) in a fuller treatment than in IM, then Hilbert-type proof 
theory (with postulates, including modus ponens), and thirdly proof theory 
handled through derived rules. The principal derived rules are essentially 
the ones, akin to those in Gentzen’s natural deduction systems, which I 
have been using in teaching logic since 1936. (In Chapter VI, a Gentzen- 
type sequent system will be introduced, as a fourth formulation of logic.) 

Part II of the book is intended to supplement Part I, by providing 
greater depth of understanding of Part I and an introduction to some of 

* Cf. the last five items under my name in the bibliography (pp. 378-379). 
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the newer ideas and more profound results of logical research in the present 
century. In Part 11 the treatment is less elementary than in Part I. Ac- 
cording to the time available and the level of the^lass, the material in 
Part II may be surveyed or studied more intensively. I have never believed 
that even the average student is benefitted by avoiding entirely material 
that only the better students can be counted on to appreciate fully. How- 
ever, trial of the material in classes has shown that, omitting the starred 
sections, a class can in a semester get part way into Chapter VI. 

Specifically, Chapter IV is the postponed “introduction” (abridged from 
IM Chapters I— HI) plus an introduction and prospectus to the study of 
formal number theory (Chapters IV, VIII of 1M). Chapter V surveys the 
famous incompleteness and undecidability results of Godel, Church and 
others, using the Turing machine concept, without always giving detailed 
proofs. (Thus an overview is provided of some of the principal results of 
IM § 42 and Part III, without the detailed theory developed there.) These 
two chapters concern the foundations of mathematics more than pure logic. 

In Chapter VI the emphasis reverts to logic. Godel’s completeness 
theorem and Gentzen’s theorem (besides theorems of Lowenheim, Skolem, 
Herbrand, Henkin, Beth, Craig and A. Robinson) are obtained, using an 
approach which has been in the literature only since 1955. There are more 
compact treatments of Godel’s completeness theorem. The one used here 
is thought to have the merit, for an introduction to the subject, that almost 
from the very beginning it should both be clear in what direction one is 
going and plausible that with patience in managing details one can thus 
reach the goal. Besides, this approach gives Gentzen’s theorem quickly, 
though nonconstructively. (This chapter corresponds to Part IV of IM, 
but there are considerable differences in the approach and selection of 
topics.) 

Chapters IV and V can be omitted in order to facilitate completing 
Chapter VI in a semester course emphasizing logic without the foundations 
of mathematics. (A few points from Chapters IV and V can then be picked 
up as needed in Chapter VI, and only a few items in Chapter VI will have 
to be omitted.) 

A fair number of exercises is supplied; but especially in Part II they do 
not illustrate all the topics. A course taught from this book is not intended 
to be primarily a problem course. The student should be disabused of the 
idea Freshman calculus students often have that the text is of no impor- 
tance except as it helps him to do the exercises. Mastering definitions is 
especially important to a proper understanding of the enterprise. 

I am grateful to H. William Oliver and Edward Pols for taking the 
notes on my lectures at NSF Summer Institutes at Williams in 1956 and 
Bowdoin in 1961, respectively. The 1956 lectures and notes were reworked 
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in 1961, and the latter have been reworked and very much expanded in 
this book. Among the added subjects are ones suggested by H. Jerome 
Keisler, Georg Kreisel and Julius R. Weinberg. Keisler also suggested 
improvements, and supplied some eight exercises, after teaching from a 
draft of the book. Weinberg, William W. Boone, Burton Dreben and 
Jean van Heijenoort helped with some references. Particularly, the last 
two persons are my source for a more accurate assessment of the contribu- 
tions of Lowenheim, Skolem and Herbrand than has hitherto prevailed 
in the literature. Finally, I thank William E. Ritter for reading the printer’s 
proof independently of myself, and for suggesting improvements made in 
proof. 

5. C. Kleene 

Madison, Wisconsin 
October, 1966 
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CHAPTER I 


THE PROPOSITIONAL CALCULUS 


§ 1. Linguistic considerations : formulas. Mathematical logic (also 
called symbolic logic) is logic treated by mathematical methods. But our 
title has a double meaning, since we shall be studying the logic that is used 
in mathematics. 

Logic has the important function of saying what follows from what. 
Every development of mathematics makes use of logic. A familiar example 
is the presentation of geometry in Euclid’s “Elements” (c. 330-320 B.C.), 
in which theorems are deduced by logic from axioms (or postulates). But 
any orderly arrangement of the content of mathematics would exhibit 
logical connections. Similarly, logic is used in organizing scientific knowl- 
edge, and as a tool of reasoning and argumentation in daily life. 

Now we are proposing to study logic, and indeed by mathematical 
methods. Here we are confronted by a bit of a paradox. For, how can we 
treat logic mathematically (or in any systematic way) without using logic 
in the treatment? 

The solution of this paradox is simple, though it will take some time 
before we can appreciate fully how it works. We simply put the logic that 
we are studying into one compartment, and the logic that we are using to 
study it in another. Instead of “compartments”, we can speak of “lan- 
guages”. When we are studying logic, the logic we are studying will pertain 
to one language, which we call the object language , because this language 
(including its logic) is an object of our study. Our study of this language 
and its logic, including our use of logic in carrying out the study, we regard 
as taking place in another language, which we call the observer's language. 1 
Or we may speak of the object logic and the observer's logic. 

It will be very important as we proceed to keep in mind this distinction 
between the logic we are studying (the object logic) and our use of logic in 
studying it (the observer’s logic). To any student who is not ready to do so, 

1 In the literature, this language is usually called the “metalanguage” or the “syntax 
language”. However, both these names often carry a connotation about the scope of 
the study or the type of the methods used in it. Cf. pp. 62-65 (especially bottom p. 63) 
of our “Introduction to Metamathematics” 1952b (hereafter cited as “IM”). 10 To avoid 
such a connotation when it is not intended, we are adopting “observer’s language”. 

In a textbook on Russian written in English, Russian is the object language and 
English is the observer’s language. 
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we suggest that he close the book now, and pick some other subject in- 
stead, such as acrostics or beekeeping. 

All of logic, like all of physics or all of history, constitutes a very rich 
and varied discipline. We follow the usual strategy for approaching such 
disciplines, by picking a small and manageable portion to treat first, after 
which we can extend our treatment to include some more. 

The portion of logic we study first deals with connections between 
propositions which depend only on how some propositions are constructed 
out of other propositions that are employed intact, as building blocks, in 
the construction. This part of logic is called propositional logic or the 
propositional calculus. 

We deal with propositions through declarative sentences which express 
them in some language (the object language); the propositions are the 
meanings of the sentences. 2 Declarative sentences express propositions 
(while interrogatory sentences ask questions and imperative sentences 
express commands). The same proposition may be expressed by different 
(declarative) sentences. Thus “John loves Jane” and “Jane is loved by 
John” express the same proposition, but “John loves Mary” expresses a 
different proposition. Under the usual definition of > from <, the two 
sentences “5 < 3” and “3 > 5” express the same proposition (which 
happens to be false), namely that increasing 5 by a suitable positive 
quantity will give 3; but “5 2 — 4 2 = 10” expresses a different proposition 
(also false). Each of “5 < 3”, “3 > 5” and “5 2 - 4 2 = 10” asserts some- 
thing about the outcome of a mathematical process, which is the same 
process in the first two cases, but a different one in the third. “3 — 2 = 1” 
and “(481 — 581) + 101 = 1” express two different propositions (both 
true). 

We save time, and retain flexibility for the applications, by not now 
describing any particular object language. (Examples will be given 
later.) 

Throughout this chapter, we shall simply assume that we are dealing 
with one or another object language in which there is a class of (declarative) 
sentences, consisting of certain sentences (the aforementioned building 
blocks) and all the further sentences that can be built from them by certain 
operations, as we describe next. These sentences we call formulas, in 
deference to the use of mathematical symbolism in them or at least in our 
names for them. 

First, in this language there are to be some unambiguously constituted 
sentences, whose internal structure we shall ignore (for our study of the 
propositional calculus) except for the purpose of identifying the sentences. 

2 Hence some writers call this part of logic “sentential logic” or the “sentential 
calculus”. 
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We call these sentences prime formulas or atoms; and we denote them by 
capital Roman letters from late in the alphabet, as “P”, “Q”, “R”, . . . , 
“Pi”, “P 2 ”, “P 3 ”, . - .. Distinct such letters shall represent distinct atoms, 
each of which is to retain its identity throughout any particular investiga- 
tion in the propositional calculus. 

Second, the language is to provide five particular constructions or 
operations for building new sentences from given sentences. Starting with 
the prime formulas or atoms, we can use these operations, over and over 
again, to build other sentences, called composite formulas or molecules, as 
follows. (The prime formulas and the composite formulas together con- 
stitute the formulas.) 3 * If each of A and B is a given formula (i.e. either a 
prime formula, or a composite formula already constructed), then A ~ B, 
A 3 B, A & B and A V B are ( composite ) formulas. If A is a given formula , 
then "iA is a ( composite ) formula. (The first four operations are “binary” 
operations, the last is “unary”.) 

.The symbols “3”, “V” and “~i” are called propositional 

connectives .* They can be read by using the words shown at the right in 
the following table; but the symbols are easier to write and manipulate. 5 6 

Equivalence ~ “(is) equivalent (to)”, “if and only if” 
Implication 3 “implies”, “if ... then ...”, “only if " 

Conjunction & “and” 

Disjunction V “or”, “. . . or . . . or both”, “and/or” 

Negation -i “not” 

Here we must mention the fact that the natural word languages, such as 
English, suffer from ambiguities. (Of this, more will be said later.) Logi- 
cians are therefore prone to build special symbolic languages. Our 

3 The analogy with chemistry limps a little, since we use “molecule” only for a formula 
which is not an atom, whereas in chemistry an atom may sometimes be a molecule 
(e.g. helium He). 

1 Other symbols are often used in the literature, the most common being “=”, 
or “Jl” for our for “3”; (sometimes omitted) or “A” for 

or ” (thus: A) for “-i”. The symbols “V” and “— |” need not be made as large 
as in the type used in this book. 

6 Anyone who doubts the advantages of symbols (in their proper place) is invited to 
solve the equation x* + 3x — 2 = 0 by completing the square (as taught in high school), 
but doing all the work in words. We start him off by stating the equation in words: 
The square of the unknown, increased by three times the unknown, and diminished by 
two, is equal to zero. 

Anyone who doubts that apt choices of mathematical symbolism have played a major 
role in the modern development of mathematics and science is invited to multiply 416 
by 144, but doing all the manipulations in Roman numerals. His problem is thus to 
multiply CDXVI by CXLIV. Cf. Time magazine, vol. 67 no. 12 (March 19, 1956), 
p. 83. 
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unspecified object language may be such a symbolic language, having 
symbols “3”, “V” and i” which play roles described 

accurately below but suggested or approximately described by the words. 
It comes to nearly the same thing to think of the object language as a 
suitably restricted and regulated part of a natural language, such as Eng- 
lish; then “3”, “V” and i” can be thQ,ught of as names in 

the observer’s language for the verbal expressions at the right in the table. 6 

The names at the left in the table above apply to the propositional 
connectives or to the formulas constructed using them. Thus is our 
symbol for conjunction; and A & B is a conjunction, namely the conjunc- 
tion of A and B. Also A 3 B is the implication by A of B; etc. 

So that there will be no ambiguity as to which formulas are the atoms, 
we now stipulate that none of the atoms be of any of the five forms A ~ B, 
A 3 B, A & B, A V B and ~\A which the molecules have. 7 * * For example, 
“Socrates is a man”, “John loves Jane”, “John loves Mary”, “5 < 3”, 
“3 > 5”, “a+b = c” and “ a > 0” (where “a”, “6”, “c” stand for 
numbers) could be atoms; then “John loves Jane or John loves Mary”, 
“~i 5 < 3” and “5 < 3 ~ 3 > 5” would be molecules. 

We are using capital Roman letters from the beginning of the alphabet, 
as “A”, “B”, “C”, . . . , “A!”, “A 2 ”, ”A 3 ”, . . . , to stand for any formulas, 
not necessarily prime. Distinct such letters “A”, “B”, “C”, . . . , “A/’, 

6 If ‘3”, etc. are symbols in the object language, then, when we write “A ~ B”, 

“A 3 B”, etc., we have a mixture of the two languages, since “A”, “B” are names in 
the observer’s language for formulas in the object language, while “3”, etc. 

are symbols of the object language itself. However, it should be clear enough here what 
is meant: “A 3 B” is a name in the observer’s language for the formula in the object 
language which results by infixing the symbol “3” of the object language between the 
two formulas in the object language which arc named in the observer’s language by 
“A” and "B” respectively. The mixing of languages disappears if we agree that “3” 
can serve as a name for itself in these contexts, and generally whenever we need a name 
in the observer’s language for the symbol “3” of the object language. We say then that 
“3” is being used autonymously (after Carnap 1 934). 10 We confine this usage to 

“3 ”, etc. and other symbols of a symbolic or partially symbolic object language when 
it should be clear that we are talking about expressions in that language. 

Ordinary English words will not be used autonymously. Hence, when they are used 
outside of quotation marks, they must be understood as in the observer’s language. 

If we should wish to name the sentence A 3 B but using the words “if . . . then . . .” 
instead of the symbol 3 (here “3” is used autonymously), we would write “ “if A 
then B” ”. (The name of the sentence is what is inside the outer quotes; the whole is 
the name of that name.) Here again there is a mixture of languages (inside the inner 
quotes), but the meaning should be clear: the sentence named is the one obtained by 
replacing the letters “A” and “B” by the sentences they name. 

7 1 f any of the sentences we originally proposed to take as prime were already of those 

forms, we could start over by dissecting them into components not of those forms and 

using those components instead in our list of atoms. 
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1< A ” “A ** 

^2 > ^*-3 > • • • 

<P ”, «Q^ « R » 


need not represent distinct formulas (in contrast to 
“Pi”, “Pj”, “P s ”, . . . , which represent distinct prime 


formulas). 

Composite formulas would sometimes be ambiguous as to the manner 
in which the symbols are to be associated if we did not introduce paren- 
theses. So we shall write “(A 3B)D C” or “A 3(BD C)” and not 
simply “A 3 B 3 C”. However we can minimize the need for parentheses 
by assigning decreasing ranks to our propositional connectives, in the 
order listed : 8 


3, &, V, “I. 


Where there would otherwise be two ways of construing a formula, the 
connective with the greater rank reaches further. Thus “A 3 B & C” shall 
mean A 3 (B & C), and “C~A&B3 C” shall mean C ~ ((A & B) 3 C). 
The unary operator n being ranked last, “~iA V B” shall mean ( — iA) V B 
rather than — j(A V B), and “”i~iA 3 A” shall mean (-i(”iA)) 3 A. This 
practice is familiar in algebra, where “a + be* — </”means(a + (b(c 2 ))) — d. 

Example 1. In “A3(B 3C)” the letters “A”, “B”, “C” stand for 
formulas constructed from P, Q, R, . . . , Pi* Ps» • • • (*- e - from the atoms 
named by “P”, “Q”, “R”, .... “Pi”, “P„”, “P 3 ”) using (zero or more 
times) ~, 3, &, V, ~i, and (as required) parentheses. For example, 
A 3 (B 3 C) might be the particular formula P 3 (Q V R :? (R 3 -iP)) 
(so A is P, B is Q V R and C is R 3 -iP). Here a second pair of paren- 
theses has been inserted, but our ranking of the symbols makes parentheses 
around Q V R superfluous. The parentheses enable us to see how the for- 
mula was constructed, starting from the atoms P, Q, R, and using five 
steps of composition to introduce the five numbered occurrences of 
propositional connectives, thus : 

P 

Q R R “)|P 

Q Vj R R 3 3 — i 2 'P 

P QV,R3 4 (R3 3 -i 2 P) 

P3 5 (QV 1 R3 4 (R3 a -> a P)) 


— l —l —a 

—3 3 

* 4 

—5 5 


8 Some authors instead rank V ahead of & ; this is done in Algol and some other 
computer-programming languages. We shall rarely use our ranking between & and V 
(which follows Hilbert and Bernays 1934 and IM).* Some authors (as Whitehead and 
Russell 1910-13) replace parentheses to a greater or lesser extent by dots 
used in the manner of punctuation marks in English. 

When we say (next) that, via the ranking, “A 3 B & C” shall mean A 3 (B & C), 
we mean that “A 3 B & C” becomes a name for the formula A 3 (B & C) which 
“A 3(B & C)” already named. 
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In a manner obvious from the parentheses or the construction, each 
occurrence of a connective “connects” or “applies to” or "operates on” 
one or two parts of the formula, called the scope of that (occurrence 
of a) connective. The scopes of the connectives are shown here by corre- 
spondingly numbered underlines; thus the scope of ID* consists of the two 
parts Q V R and R 3 -iP. 

Exercise 1.1. Identify the scope of each (occurrence of a) propositional 
connective : (a) P 23 — iP ~ — iP. (b) ~iP & Q ~ R & — j — i(P V Q) "D S. 

§ 2. Model theory: truth tables, validity. Not only are we restrict- 
ing ourselves in this chapter to the study of the logic of propositions. But 
also in this and later chapters we shall concern ourselves primarily with a 
certain kind of logic, called classical logic. 

Since the discovery of non-EucIidean geometries by Lobatchevsky 
(1829) and Bolyai (1833), it has been clear that different systems of geom- 
etry are conceptually equally possible. (We shall say a little more about 
this in § 36.) Similarly, there are different systems of logic. Different 
theories can be deduced from the same mathematical postulates, the 
differences depending on the system of logic used to make the deductions. 
The classical logic, like the Euclidean geometry, is the simplest and the 
most commonly used i n math ematics, science and daily life. In this book 
we shall find the space for only brief indications of other kinds of logic. 0 

Thus far we have assumed about each prime formula or atom only that 
it can be identified ; i.e. that each time it occurs it can be recognized as the 
same, and as different from other atoms. 

Now we make one further assumption about the atoms, which is charac- 
teristic of classical logic. We assume that each atom (or the proposition 
it expresses) is either t rue or false but not both. 

We are not assuming that we know of each atom whether it is true or 
false. That knowledge would require us to look into the constitution of 
the atoms, or to consider facts to which they allude under an agreed 
interpretation of the words or symbols, none of which is within our 
purview in the propositional calculus. 

Our assumption is thus that, for each atom, there are exactly two possi- 
bilities: it may be true, it may be false. 

The question now arises: How does the truth or falsity (truth value) 
of a composite formula or molecule depend upon the truth value(s) of its 
component prime formula(s) or atom(s)? This will be determined by re- 
peated use of five definitions, given by the following tables. These tables 
relate the truth value of each molecule to the truth value(s) of its immediate 

9 Some other kinds of logic would require other propositional connectives than the 
five vve introduced in § 1, e.g. □ and 0 end § 12. 
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component(s). In the left-hand columns we list all the possible assignments 
of truth t and falsity f to the immediate component(s). Then in the line 
(or row) for a given assignment, we show the resulting truth value of each 
molecule in the column headed by that molecule. 


A 

B 

A ~ B 

A 3 B 

A & B 

AV B 

A 

~iA 

t 

t 

t 

t 

t 

t 

t 

f 

t 

f 

f 

f 

f 

f 

f j 

t 

f 

t 

f 

t 

f 

t 



f 

f 

t 

t 

f 

f 




Thus A ~ B is true exactly when A and B have the same truth value (hence 
the reading “equivalent”, i.e. “equal valued”, for ~); A 3 B is false 
exactly when A is true and B is false; A & B is true exactly when A and B 
are both true; A V B is false exactly when both are false; and -iA is true 
exactly when A is false. 

Some controversy has arisen about the name “implication”, and the 
reading “implies”, for our 3. Say A is “the moon is made of green cheese" 
and B is “2+2=5”. Then A 3 B is true under our table (because A is 
false), even though there is no connection of ideas between A and B. 
Similarly, if B is “2+2=4”, A 3 B is true (because B is true), quite apart 
from whether A bears any relationship to “2+2=4”. This is considered 
paradoxical by some writers (Lewis 1912, 1917, Lewis and Langford I932). 10 

In modern mathematics the name “multiplication” is often used for 
various mathematical operations that behave more or less analogously to 
the arithmetical one called “multiplication”. Similarly, we find it con- 
venient to use the name “implication” to designate the operation defined 
by the second truth table above; and then in our logical discussions we 
usually read ADBas “A implies B”, even though “if A then B” or “A 
only if B” probably renders the meaning better in everyday English. This 
“implication”, and the present “equivalence”, are called more specifically 
“material implication” and “material equivalence”. 11 

It is, of course, possible to be interested in other senses of “implication”; 
but then one must have recourse to ways of defining it other than by a 

10 A date appearing in conjunction with a person’s name ordinarily constitutes a 
reference to the bibliography at the end of the book. The few exceptions are dates of old 
and well-known works not primarily in logic and dates not associated with publication. 

11 In everyday English, "... if . . . then” functions grammatically as a conjunction 
like “and” and “or” (connecting sentences), while “implies” is a transitive verb 
(connecting nouns). From this standpoint, when we use “if A then B” and “A implies B” 
interchangeably, we can regard the latter as short for “ “A” implies “B” ” or “that A 
implies that B”. 
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“two-valued truth table”. Our definition is the only reasonable one with 
such a table. 12 

A related question is why w'e should want to assert a material implication 
A D B, when if A is true we could more simply assert B (or if our hearers 
don’t know that A is true, we could more informatively assert A & B), and 
if A is false we could more simply say nothing. But we ordinarily assert 
sentences of the form “If A, then B” when we don’t know whether A is 
true or not. For example, before an election 1 might say [1] “If our can- 
didate for President carries the state by 500,000, then our man for the 
Senate will also win”. This form of statement enables me to predict what 
will happen in one eventuality without attempting to say more. If it turns 
out that our candidate for President doesn’t carry the state by 500,000, my 
prediction will not have been proved false. Since we are committed here 
to a two-valued logic, my statement should then be regarded as true, 
though perhaps uninteresting. If when I speak, returns are in showing our 
candidate ahead by a safe 500,000, I would more likely say [la] “Our 
man for the Senate will win” or [lb] “Since our candidate for President 
is carrying the state by 500,000, our man for the Senate will win”. But [1] 
would not have become false, just partially redundant and thus unnatural 
for me to say (unless 1 haven’t heard the latest election returns). 

To take an analogous mathematical example, suppose a positive integer 
n > 1 is written on a piece of paper in your pocket, and I do not know 
what the integer is. 1 could truthfully say [2] “If n is odd, then x n + y n 
can be factored”. By saying this, I am claiming that, when you produce 
the paper showing the value of n, then 1 will be able to factor x n + y n for 
the n you produce if it turns out to be odd (and 1 am making no claim 
about the factorability of z" + y n in the contrary case). Thus, if you have 
bet that I am wrong, to settle the bet you produce the number n. If for 
example it is 3, I then show you the factorization (x + y)(x 2 — xy + y 2 ), 
and you pay me. If for example it is 4 (or 6), I wiif automatically. 

These examples should make it clear that material implication AOB 
(“If A, then B”) is a useful and natural form of expression. 13 Similar remarks 
apply to material equivalence A ^ B. 

12 Then ordinary usage certainly requires “If A, then B” to be true when A and B 
are both true, and to be false when A is true but B is false. So only our choice of t in 
the third and fourth lines can be questioned. But if we changed t to f in both these lines, 
we would simply get a synonym for & ; in the third line only, for If we changed t tof 
in the fourth line only, we would lose the useful property of our implication that “If A, 
then B“ and “If not B, then not A” are true under exactly the same circumstances 
(which will appear later as *12a in Theorem 2). 

“ We are talking about the use of “If. . . , then . . .” with verbs in the indicative 
mood. Grammar allows also contrary-to-fact conditionals with verbs in the subjunctive 
mood. These are "If A, then B” sentences where the falsity of A (which is indicated) 
does not make the whole true irrespective of what B is. Say the n > 1 in your pocket 
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Likewise, when we don’t know whether A is true or not, and don’t 
know whether B is true or not, it can be useful to assert “A or B” or in 
symbols A V B, If we already knew that A is true, it would be simpler 
and more informative to say “A”; etc. Our disjunction A V B, defined by 
the fourth truth table, is the “inclusive disjunction” or “nonexclusive 
disjunction”, which is true when A is true or B is true or both A and B 
are true. This is more useful to us than the “exclusive disjunction”, 
expressed by the words “A or B but not both”, which instead has f also 
in the first row. While English is ambiguous, Latin is clear, using “vel” 
for the inclusive disjunction and “aut” for the exclusive disjunction. The 
symbol V comes from the first letter of “vel”. 

We postpone further discussion of the relation between our symbols 
and ordinary language to the end of the chapter. 

We now illustrate the repeated use of the above tables by computing 
the truth table for PD(QVR3(R3-iP)). The result is shown first 
as (1) below, then the details of the computation of the third line (or row). 
For this line, we first substitute for the atoms P, Q, R the respective values 
t, f, t assigned to them for that line. Then we compute the values of inner- 
most composite parts repeatedly. Thus by the table for V, fV t is t; by 
the table for ~i, -it is f; and by the table for 3, t 3 f is f (which we use 
three times). The successive stages in this computation are shown in suc- 
cessive lines for clarity, and are then summarized in a single line. 

Completed truth table: 


( 1 ) 



P 

Q 

R 

P 3 (Q V R 3 (R 3 -mP)) 

1. 

t 

t 

t 

f - 

2. 

t 

t 

f 

t 

3. 

t 

f 

t 

f - 

4. 

t 

f 

f 

t 

5. 

f 

t 

t 

t • 

6. 

f 

t 

f 

t 

7. 

f 

f 

t 

t 

8. 

f 

f 

f 

t 


is already known to me and is 4. I could say truthfully [2'] “If the n had been odd, I 
could have factored x " + y n ”; but I could not say truthfully [2"] “If the n had been odd, 
I could have factored x n+1 + y n + l ”. (With n odd, x" +1 + y n+1 may or may not be 
factorable.) A contrary-to-fact conditional “If A, then B” makes an assertion about a 
"hypothetical situation” analogous to the actual situation but differing from it by A 
holding. Sunday morning quarterbacks and Wednesday morning politicians find them 
useful. We gave a mathematical example, because we can be positive in affirming [2'] 
and refraining from affirming [2'], whereas in football and politics matters are more 
controversial. 
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Computation of the third line: 

P 3 (Q V R 3 (R 3 -iP» 
t 3(f V t 3(t => -it)) 
t =3 ( t 3 ( t 3 f )) 

t=>( t 3 f ) 

t= f 
f 


t f f t t f t f f t 

The computation process illustrated just now constitutes a mechanical 
procedure by which we can compute the truth table for any formula E, or 
more specifically the truth table for E using (or “entered from”) a given 
list P,, . . . , P„ of the prime components of E. (In (1), we used the list 
P, Q, R; we could have used instead Q, P, R or Q, R, P, etc. to obtain 
different tabulations of the same collection of eight computation results.) 
In the trivial case that E is a prime formula P, the computation takes zero 
steps, and the value column is identical with the column of assignments 
to P. 

In practice it is not always necessary to apply the procedure in full 
detail. Thus the observation that A 3 B is t whenever A is f (irrespective 
of the truth value of B) suffices to justify our entering t in the last four 
lines of the above table without further ado. 

There are formulas for which the value column in the truth table will 
contain only t’s, for example P & — iP 3 (Q V R 3 (R 3 — iP)), 

P 3 — iP ~ — iP and P 3 P, as the reader may verify (Exercise 2.2). The 
order of listing the prime components does not matter here. (Why?) 
Such formulas are therefore always true, regardless of the truth or falsity 
of their prime components. Without knowing the truth values of the prime 
components, we can nevertheless say that the composite formula is true. 
Such formulas are said to be valid, or to be identically true , or (after 
Wittgenstein 1921) to be tautologies (in, or of, the propositional calculus). 

To give a verbal example, the proposition “If I am going too fast, 
then I am going too fast” is true on the basis of the propositional calculus; 
indeed, it has the form P 3 P. But the proposition “I am going too fast”, 
if true, is true on other grounds. 

It might seem tha t th e valid formulas or tautol ogies are_tjje least 
interesting, because from one point of view they give~no~ information. 
My admitting that “If I am going too fast, then I am going too fast” 
can hardly give any of you much satisfaction. But it will appear as we 
proceed that the tautologies are important. 
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Exercises. 2.1. Find the truth tables: (a) -iP V Q. (Compare this 
with the table for P=>Q.) (b) (iP V Q)&(RD(P~ Q)). 

(c) QDPVQ. Are any of these formulas valid? 

2.2. Verify that P & ~ iP D(QVR3(R3 ~iP)), P => “iP ~ “iP and 
P D P are valid. 

2.3. Show that the following formulas are valid. To reduce the work, 
observe that the whole implication A 3 B fails to be valid only if you can 
pick truth values for P, Q (or for P, Q, R, S) which simultaneously make 
B take the value f and A the value t. Consider all the choices of values that 
make B f, and verify that none of them make A t. 

(a) ((P D Q) D P) D P. (Peirce's law, 1885.) 

(b) ((P 3R)&(Q3 S)) & (~ iR V — ]S) 33 — iP V — iQ. 

(c) (PDQ)3 (-iQ 3 -iP). 14 

2.4. Show the following not valid by computing just one suitable 
line of the table: (a) P V Q 3 P & Q. (b) (P D Q) 3 (Q 3 P).’ 4 

2.5. Find formulas composed from P, Q, R whose truth tables have 
the following value columns: (a) fffftfff. (Use a method applicable 
to any truth table with just one t.) (b) tffftftf. (First use a method 
applicable to any table with more than one t. Can you find a shorter 
formula with the same table?) (c) fffftfff* 

§3. Model theory: the substitution rule, a collection of valid 
formulas. The defi nition of validity provides us with an automatic _ way 
of decidin g as to the validity of any fo rmula: simply compute its truth 
table, and see whether we get all t’s. This is a very fortunate situation, 
and one should not hesitate to do this in any case of doubt. 

However, computing truth tables of formulas at random would be a 
rather slow way of discovering valid formulas. Anyone not familiar with 
simple examples of valid formulas and with methods for proceeding to 
others (whether or not he has officially studied logic) would properly be 
described as sluggish in his mental processes. 

One simple principle is this. In defining validity, we use a truth table 
entered from the prime components, so as to take into account all the 
structure of the formula available to the propositional calculus. However, 
to establish validity, we may not need to dissect a formula all the way 
down to its prime components or atoms. If we get all t’s in a table entered 
from (values of) components not necessarily prime, we can be sure it is 
valid. For example, P & — iP Z) P & ~ iP is of the form A Z3 A; Table (a) 
(below) entered from A gives all t’s; hence the formula is valid. For, in 
computing each line of Table (b) entered from P (as is called for under 
the definition of validity), the first part of the computation consists in 


11 — |Q 3 — iP is the contrapositive of P 3 Q, and Q 3 P is the converse of P 3 Q. 
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computing the value of P & — iP, i.e. of the A. Then the rest of the com- 
putation consists in computing the value of the whole from that of A 
(shown underlined in Table (b)>; but this we have already done with result 
t in computing Line 2 of Table (a). Table (a) is the same as Table (c) 



(a) 


(b) 


(C) 

A 

A 3 A 

P 

P & — 1 P 3 P&-1P 

P 

P 3 P 

t 

t t t 

1 

f 

t f f t t t f f t 

t 

t t t 

f 

f t f 

f 

f f t f t f f t f 

f 

f t f 


except for the notation; instead of saying we construct a table for A 3 A 
entered from A, it comes to the same thing to say that we verify the 
validity of P 3 P, and then substitute A (i.e. P & “iP) for P in P 3 P. 
This reasoning gives the following theorem, in which we write “k E” 
as a short way of saying “E is valid”. 15 

Theorem 1. (Substitution for atoms.) Let E he a formula containing 
only the atoms P t , . . . , P„, and let E* come from E by substituting 
formulas A,, . . . , A„ simultaneously for P 1( . . . , P n , respectively. If\= E, 
then f E*. 

On the other hand, to show by truth tables that a formula is not valid, 
the tables must in general be entered from the prime components. For 
example, P&~iP3Q is of the form A 3 B. The table for A3B 
entered from A and B (§ 2) does not have all t’s (in other words, P 3 Q 
is not valid). But P & ~iP 3 Q is valid. This example shows that the 
converse of Theorem I, namely “Iff E*, then t= E”, does not hold. 

Returning to the example preceding Theorem 1, since Table (a) entered 
from A has all t’s (or equivalently (c) has all t’s), we shall have all t's for 
every formula of the form A 3 A, not just the particular one we took 
there with P & ~iP as the A. This is included in the theorem; for, with 
E fixed and “1= E” established, we can apply the theorem with any choice 
of A lt . . . , A„. 

n Expressions containing “k” ("k E” here and “A„ . . . , A„ k B” in § 7) are not 
formulas of the object language, but expressions of the observer’s language, used in 
writing concisely certain statements about formulas. The definition of “formula” for 
the propositional calculus was concluded in § 1, and allows only 3, &, V, — l (as 
symbols of the object language) to be used in building up formulas from the atoms P, 

q, R p„ p., P a , . . .. Now “f ” is a symbol of the observer’s language, and hence 

stands outside every formula, and outranks — , O, &, V, — i; thus “|= A ~B” means 
“k (A ~ B)” rather than “(k A) ~ B”. 
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We use this principle to establish in the next theorem a collection of 
forms of valid formulas. 16 

For example, as *1 we give the result just established. 

As 5b, we claim that, for each choice of formulas (built up from P, Q, 
R, . . . , P x , P 2 , P 3 , . . .) as the A and B, the resulting formula BDAVB 
is valid. For, in Exercise 2.1 (c) we saw that Q 3 P V Q is valid; and 
hence by Theorem 1 B 3 A V B is valid. 

In the same way, every one of the results in Theorem 2 can be proved 
automatically, by first verifying the validity of the particular formula 
which has P, Q, R in place of A, B, C, and then using Theorem 1 (or 
equivalently, by constructing a truth table entered from A, B, C). 

The student may accordingly take the whole list on faith, as he does a 
table of square roots or trigonometric functions or integrals. 

We intend that the student should acquire the ability to use these results, 
and indeed should learn enough of them so that he can operate without 
Theorem 2 before him. However, we do not ask him to learn the list 
outright now, but rather to use it for reference and in doing so to become 
familiar with the results most often used. 17 


Theorem 2. For any choice of formulas A, B, C : 

la. 

k A 3 (B 3 A). 


lb. 

k (A 3 B) 3 ((A 3 (B 3 C)) 3 (A 3 C)). 

3. 

k A 3 (B 3 A & B). 

4a. k A & B 3 A. 



4b. k A & B 3 B. 

5a. 

k A 3 A V B. 

6 . k (A 3 C) 3 ((B 3 C) 3 

5b. 

k B 3 A V B. 

(A V B 3 C)). 


i» Most of these results have equivalents in IM (= Kleene “Introduction to Meta- 
mathematics” 1952b). With a few exceptions, we employ the numbering of IM. This 
will facilitate using IM as a reference work supplementing the present book, or the 
present book as an introduction to IM. (This accounts for the gaps and other irregu- 
larities in the numbering in Theorem 2. The present 9a, 10a, 10b, *4a, *12a, *55c, *63a 
do not correspond to like-numbered results in IM; and *55a, *55b correspond to *63, 
*62 of IM, but are renumbered here to come earlier.) 

The meaning of on 8, *12a, etc. will be explained at the end of § 12. 

17 The developments below should assist the reader in becoming familiar with 
these and other results. We shall make various applications, and establish interconnec- 
tions, which should help to fix them in mind. For some of them we shall give new proofs 
to make the resuits more meaningful. 

Following Church 1956 p. 73, *49 may be called more specifically the “complete 
law of double negation”; 8 the “law of double negation” simply; and the converse of 8 
the “converse law of double negation”. 14 Similarly, *12a is the "complete law of contra- 
position”; with ~ replaced by 3, the “law of contraposition”; by c, the “converse law 
of contraposition”. Also, la is the “law of affirmation of the consequent” (cf, *10a); 
and 7 is the “law of reductio ad absurdum”. With ~ replaced by 3, *4a is “importa- 
tion”; by c, “exportation". (When replacing ~ here, supply parentheses.) 
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7. MA 3 B) 3 8°. b —>~iA 3 A. 

((A 3 -iB) 3 ~iA). 

9a. I(A3B)D 10a. b (A ~ B) 3 (A 3 B). 

((BDA)D(A~B)). 10b. b (A ~ B) 3 (B 3 A), 

(introductions and eliminations of logical symbols.) 

*1. b A 3 A. *2. t(A3B)D((B3C)3 

(A 3C)). 

*3. b A 3 (I! 3 C) ~ *4a. tAD(BDC)^ 

B3(A3C). A & B 3 C. 

(Principle of identity, chain inference, 
interchange of premises, importation and exportation.) 

*I0a. !-- -iA 3 (A 3 B). *12a°. 1A3B~ ~iB 3 -iA. 

(Denial of the antecedent, contraposition.) 

*19. NA~A. *20. b (A~B)~(B~A). 

*21. MA~B)&(B~C)3(A^C). 


(Reflexive, symmetric and transitive properties of equivalence.) 


*31. b (A&-B) &C~ 

A &(B &C). 

*33. 1= A & B ~ B & A. 

*35. b A & (B V C) ~ 

(A & B) V (A & C). 
*37. b A & A ~ A. 

*39. 1= A & (A V B) ~ A. 


*32. b(AVB)VC~ 

A V (B V C). 

*34. t A V B ~ B V A. 

*36. b AV(B&C)~ 

(A V B) & (A V C). 
*38. b A ,V A ~ A. 

*40. b A V (A & B) ~ A. 


(Associative, commutative, distributive, idempotent and elimination laws.) 


*49°. b — i~iA ~ A. 

*50. b — i(A & ~iA). *51°. b A V nA. 


(Law of double negation, 

denial of contradiction, law of the excluded middle.) 

*55a. b — i(A V B) — -iA &~iB. *55b°. b -i(A & B) iA V ~iB. 

*55c°. b -i(A 3 B) ~ A & -iB. 


(De Morgan’s laws 1847, 18 
negation of an implication.) 

*56'. = AVB - -(-.A & -iB). *57 : . b A & i(-jA V mB). 

^*58°. b A 3 B ~ — i(A & nB). *59°. b A 3 B ~ ~tA V B. 

*60°. b A & B — -i(A 3 -iB). *61°. bAVB~~iA3B, 

*63a. b (A ~ B) ~ (A 3 B) & (B 3 A). 

(Expressions for some connectives in terms of others.) 

-i 

18 In verbal form, these go back at least to Ockham (“Summa Logicae”, 1323-9). 
Cf. Lukasiewicz 1934, Bocheriski 1956. 
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Exercises. 3.1. Redo the illustration preceding Theorem 1 (with 
Tables (a), (b), (c)) to show that P V ~ iQ 3 P V ~ iQ is valid (i.e. taking 
P V -iQ instead of P & -iP as the A). 

3.2. Establish la, 4a, 6, 7, *50, *51 by the automatic method (indi- 
cated above), except using shortcuts when you can in the truth table 
computations. 

3.3. Show that, if a table for a formula entered from components not 
necessarily prime has all f’s, then the formula is not valid. (Cf. the first 
remark following Theorem 1.) 

§4. Model theory: implication and equivalence. Suppose the 
truth table for a formula E is constructed as in § 2 by using exactly its 
prime components Pj, .... P„, and suppose that a new table is con- 
structed for E using additional atoms P n+I , . . . , P M+m not in E. Then 
the new table differs from the original table only in that the value column 
of the new table splits into 2 m parts, corresponding to the 2 m assignments 
of t’s and f’s to the atoms P„ +1 , .... P n+m which do not occur in E. Each 
of these 2"‘ parts is a duplicate of the value column of the original table, 
since the same computation (based only on the assignments to P 1( . . . , P„) 
is used in each part. For example with n — 2 and m = 1, Tables (e), (f), (g) 
below have been constructed by entering from three atoms, although the 
formula at the head of each of those tables contains just two atoms. 



Pi 

P 2 

Pa 

(d) 

(P 1 VP 2 )&(P 1 DP 3 ) 

(e) 

p 2 vp 3 

(0 

p 1 &p 3 

(g) 

p 2 =>p 2 vp 3 

1. 

t 

t 

t 

t 

t 

t 

t 

2. 

t 

t 

f 

f 

f 

f 

t 

3. 

t 

f 

t 

t 

t 

t 

t 

4. 

t 

f 

f 

f 

f 

f 

t 

5. 

f 

t 

t 

t 

t 

f 

t 

6. 

f 

t 

f 

t 

t 

f 

t 

7. 

f 

f 

t 

f 

t 

f 

t 

8. 

f 

f 

f 

f 

f 

f 

t 


In Tables (e) and (g), Lines 5-8 (Pj is f) are duplicates respectively of 
Lines 1-4 (P, is t); and in Table (f), Lines 3, 4, 7, 8 (P 2 is f) are duplicates 
respectively of Lines 1, 2, 5, 6 (P a is t). 

In particular, if the table for a formula E entered from only its prime 
components contains only t’s, then so does the table for E entered using 
given additional atoms; and conversely. (This is illustrated by Table (g).) 
Thus, L E if and only if the table for E entered from any particular list 
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I 5 ,, . . . , P„ (containing at least all the prime components of E) has 
only t’s. 

In Theorems 3 and 4, \vc shall compare truth tables for A and B (also 
for A 3 B or A ~ B). To make this easy, we shall enter each table from 
one list of atoms P„ . . . , P„, including all that occur in either ot A and B. 
So if A and B do not contain the same atoms, the table for A or for B is 
entered from more atoms than occur in it. By the preceding discussion, 
it will make no difference if the list Pi, ... , P„ contains still more atoms. 

I THEOREM 3. If t= A and f A3B, then t B. 

Proof. Consider any assignment of t’s and f’s to a list P 1( . . . , P„ of 
atoms as described. The computation of the corresponding value of A 3 B 
consists in first computing the values of A and B, and thence computing 
the value of A 3 B by the table for 3 (beginning of § 2). By the hypotheses 
that F A and F A 3 B, both the value obtained for A and the final value for 
A 3 B are t. From the table for 3, this can only be the case when Line 1 
of that table applies, and in Line I of that table B is also t. Since this is the 
case for each assignment to P,, . . . , P,„ the formula B receives the value t 
for all assignments, i.e. F B, as was to be shown. 

Theorem 4. (a) For each assignment, A ~ B is t if and only if A and B 
have the same truth value. Hence: (b)° 1= A~B if and only if A and B 
have the same truth table. 

Proof. Consider any formulas A and B. (a) In the computation of 
the value of A ~ B for a given assignment of t’s and f’s to Pj, . . . , P,„ the 
first part consists in computing values of A and B, after which the computa- 
tion is concluded by entering the basic table for A B in § 2 with the 
resulting values of A and B. From that table we see that A ' ^ B is t if and 
only if the values computed for A and B are the,. same, (b) So the table 
for our A ~ B has all t’s, exactly if, for every assignment, A and B have 
the same value. 

Example 2°. By (b) of the theorem with the result of Exercise 2.1 (a), 
r P 3 Q ~ mP V Q (and F -iP V Q ~ P 3 Q). Thence by substitution 
(Theorem 1), F A3B~-iAVB (and F ~rA V B ~ A 3 B). Thus we 
reprove *59 of Theorem 2. (This proof differs from the one suggested in 
§ 3 only in that we now take into account the general principle stated as 
Theorem 4 (b), instead of (like robots) separately completing the computa- 
tion of P 3 Q ~ -iP V Q in each line). 

Theorem 5. (Replacementthcorem.) Let C A be a formula containing a 
formula A as a specified ( consecutive ) part, and let C| t come from C A by 
replacing that part by a formula B. If F A ~ B, then F C A ■ — C l5 . 
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Proof. Assume 1= A ~ B. Then by Theorem 4 (b), A and B have 
the same table. Hence if, in the computation of a given line of the table 
for C A , we replace the computation of the specified part A by a computa- 
tion of B instead, the outcome will be unchanged. Thus C IS has the same 
table as C A ; so by Theorem 4 (b), t= C A ~ C n . 

Example 3°. From Example 2 by Theorem 5, 

f -iP V Q D (P 3 iP VQ ) ~-iPVQD(P3(P3 Q)). 

The part A of C A is underlined. In writing C n , a pair of parentheses is 
required which was unnecessary in C A . 

By a “consecutive” formula part A of C A we are understanding a for- 
mula part A which is consecutive before parentheses are omitted, and 
whose value is thus computed in the course of computing the value of the 
whole C A . Thus P V Q does not occur as a consecutive part of 
-iP V Q 3 (P 3 -iP V Q), as becomes clear upon restoring some paren- 
theses : (-iP)VQD(PD iP) V Q). 

Corollary. (Replacement rule, or replacement property of equiva- 
lence.) IfY C A and 1= A ~ B, then 1= C B . 

Proof. By the hypothesis that t= A~B with the theorem, I C A ~ Cjj. 
So by Theorem 4 (b), C A and have the same table. By the hypothesis 
that 1= C A , this table has all t’s. 

Exfrcisfs. 4.1. In the manner of Example 2, reprove *31, *34, *49, 
*55a, *55c of Theorem 2. 

4.2°. Similarly establish that: 

(a) h (A ~ B) ~ (A & B) V (lA & -iB). 

(b) t= — i(A ~ B) ~ (A & iB) V (— iA & B). 

4.3. Illustrate the proof of Theorem 5 by computing the second line 
(for t f assigned to P Q) of the tables for — iP VQD(PD ”iP V Q) and 
— iP VQD(PD(P3 Q)). Underline the common parts (as in Tables 
(a), (b)). 

4.4°. Use Theorem 5 with *55a in Theorem 2 to establish that 
t= — i — i( — iA V -iB) ~ — 1 ( — i — i A & — i — jB). (Observe that, whatever for- 
mulas constructed from P, Q, R, . . . , P a , P 2 , P 37 • • • “A” and “B” stand 
for here, *55a will hold when its A and B are the present -iA and -iB.) 

4.5. Using h ->A VB~A3B (Example 2), infer *10a from 5a. 

4.6. Give three proofs that: If 1= A and N A ^ B, then N B. (By 
Theorem 4 (b); by Corollary Theorem 5; using 10a and Theorem 3.) 

4.7. Show by an example that Corollary Theorem 5 does not hold with 
“3” in place of 
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4.8. Establish the following propositions, where A is a formula 
containing no occurrence of the symbol ~i, and B is any formula. 

(a) The truth table of A has t in its first line. 

(b) if f — i B, then B contains at least one occurrence of ~ i. 

(c) If L B — ' — iA, then B contains at least one ~i. 

§ 5. Model theory: chains of eq uivale nces. It is often useful to 
know that two formulas A and B have the same truth table, or to trans- 
form a given formula A into a formula B of some specified sort which has 
the same table. By Theorem 4 (b), A and B have the same table exactly 
when b A — B, i.e. when the formula asserting the (material) equivalence 
of A and B is valid. In this case, we may say that A and B are ( logically ) 
equivalent (in the propositional calculus). Of the 45 results in Theorem 2, 
26 are thus assertions of equivalences holding in the propositional 
calculus. 

The chain method which we present next is useful in establishing such 
equivalences. 

First note that: (a) 1= A & B if and only if both 1= A and 1= B. This is 
immediate from the truth table for & (or we can infer it by Theorem 3 
from 3, 4a and 4b in Theorem 2). 

Next we observe that equivalence in the propositional calculus is 
reflexive, symmetric and transitive: ((3) b A ~ A. (y) If b A ~ B, then 
b B A. (8) If b A ~ B and b B ~ C, then !. A ~ C. These three 
statements are immediate from Theorem 4 (b). (Alternatively, ((3) is 
*19; (y) follows from *20 by Exercise 4.6; and (8) from *21 using (a) and 
Theorem 3.) 

Using ((3)-(8): (e) If V A 0 ~ A x and b A t ~ A 2 and 1= A Z ~A 3 , then 
b A, ~ A , for each of the 16 pairs of subscripts i, j (/, j — 0, 1, 2, 3); 
i.e. b A 0 ~ A„, b A 0 ~ A I; b A 0 ~ A 2 , b A 0 ~ A 3 , b A : ~ A 0 , . . . , 
b Aj~ A 0 , b A 3 - ■ Aj, b A 3 ~ A,, b A s ~ A 3 . Thus ((3) gives b A 0 ~ A„; 
to get b A 3 ~ A x , we use b Aj ~ A 2 and b A,~A 3 with (8), and the result 
with (y); etc. Or (e) can be recognized as true directly from Theorem 4 (b); 
for, the three hypotheses of (e) say that in the list A 0 , Aj, A 2 , A 3 each suc- 
cessive formula has the same table as the preceding, and the conclusion says 
that any pair of formulas in the list have the same table. 

Now we adopt “A„ ~ Aj ~ A 2 ~ A 3 ” as an abbreviation for 
((A„ ~ A,) & (Aj - — - A 2 )) & (A, ~ A 3 ). Then by two applications of 
(7.): (£). The hypothesis of (e) is equivalent to b A 0 ~ A t < — > A 2 < — - A 3 . 

We call A„ ~ Aj ~ A 2 ~ A 3 a “ch ain qf_ (thre eX_£quivajences”. It 
has the properties that we can establish its validity by establishing the 
validity of the (three) “links”, and that once the chain is established as valid 
we can infer the equivalence of any pair of the formulas A 0 , A lf A 2 , A 3 
(joined by links) in the chain. 
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Everything beginning with (e) said using A 0 , A„ A.,, A 3 applies similarly 

to A 0 A n for any n ;> 2 (and trivially even for n = 0, 1). 

Now we remark that, once *49, *55a and *55c in Theorem 2 are 
established (as proposed in §3, or by Exercise 4.1), then all of *55b, 
*56— *6 1 follow by the chain method. For example: 

*55b. By *49 (since the A of *49 can be any formula, e.g. the present 
- 1 AV- 1 B) and (y): (1) 1= ~iA V ~iB ~ — i— )(— iA V ~iB). By Theorem 5 
with *55a (as in Exercise 4.4): 

(2) N — i-i(— iA V ~iB) ~ — i(-|— i iA & — i — iB). By Theorem 5 with *49: 

(3) t= — 1( — i — iA & -i-i B) ~ -i(A & — i — jB), 

(4) h “i(A & — i — iB) ~ ~i(A & B). From (1)— (4) by (f) (for n — 4), • 

f -i(A & B) ~ — iA V ~iB, as was to be shown. — Using (Q, we can put 
this proof in the following shorthand: 

1= -iA V “iB ~ — i — i( — iA V ~ iB) [*49] ~ — 1 ( — i — iA & tiB) [*55a] ~ 

-i(A & B) [*49], 

*57. t= A & B ~ — i— i(A & B) [*49] ~ -i(-iA V -iB) [*55b]. 

*60. I=A&B~A & — i — iB [*49] ~ -i(A 3 -iB) [*55c]. 

Since in § 3 we could accept all the results of Theorem 2 as established 
by someone else’s computation, the real point of these reproofs is to 
make it evident how, if we remember *49, and any two of *55a-*61 
which together contain all three of the symbols 3, &, V, we can quickly 
derive the others. 

Now we use the chain method to get a new result. 

Theorem 6°. Let E be any formula constructed from atoms P 1( . . . , P„ 
and their negations — iPj, . . . , — iP„ using only & and V. Let E f come from 
E by interchanging & with V and each unnegated atom with its negation 
(cf. the example in the proof). 19 Then t= — lE ~ E t . 

Proof. Using *55a and *55b (with the chain method), we can move 
the initial ~i of “lE progressively to the right (inward) across all the &’s 
and V’s, which interchanges them. Then we can use *49 to remove the 
resulting double negations, so that the atoms will be interchanged with 
their negations. The following example illustrates this proof. 

“fliiuit <t — 1( — iQ & ( -)P V Q)) 

E ~ — i— iQ V “ i( “iP V Q) [*55b] 

~ — l — iQ V ( — i — iP & -tQ) [*55a] 

~ QV ( P & — iQ) [*49], 

Corollary 0 . Each formula E is equivalent to a formula F (i.e. 
f E ~ F) in which occurs only applied directly to atoms. 

19 If in writing E parentheses were omitted under our ranking of & ahead of V 
(cf. § 1), they should be restored before performing the operation t here, or the operation 
' in Theorem 7. 
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Prool. First, we can eliminate ~ and 3 from E by *63a, and *58 
nr *59 (or possibly *55c, *60 or *61). Next, we can suppress any double 
negations by *49. Finally, Theorem 6 can be used to eliminate successively 
each -i which does not apply directly to an atom; in doing so, we work 
each time on such a ~i that is innermost (i.e. docs not have another such 
-i within its scope. Example I). This should become clear from the 
following illustration. 

F — 1{— iP 3 — 1(— i— iP V iQ) & R} 

~ “iJ-i-iP V ( t( t tP V -i Q) & R)} [eliminating 3 by *59] 

~~ ~h'P V ( if P V -iQ) & R)} [suppressing the double negations by *49] 
~ ~i{P V ( iP & Q & R)} [applying Theorem 6 to the part — t(P V — iQ)] 
~iP & (P V — iQ V iR) [applying Theorem 6 to the whole]. 

As we eliminate the 3, we supply a pair of parentheses, which before were 
superfluous since 3 outranks &. In the fourth and final formulas, we omit 
a pair of parentheses (as mathematicians do in v/riting “a+b+c”), since 
by *31 and *32 it is immaterial for present purposes which way the triple 
conjunction and disjunction are associated. 20 

Exercises. 5.1. Use the chain method to derive *56, *58, *59, *61 
(taking *49, *55a-*55c as already established). 

5.2. Find equivalent formulas with -> applied only to atoms: 

(a) — i((P & ~ iQ) V R V (S & — iP)). (b) -i(P V -iQ 3 (R & — I — iS) V Q). 

(c) — 1 (— j(P & Q) ~ P). 

5.3°. Establish the following, using as far as possible recent results 
rather than new direct appeals to truth tables: 

(a) F (A — li) --(AD B) & (B 3 A) ~ (~iA V B) & (A V ~iB) ~ 

(A & B) V ( i A & -iB). (b) F — i(A ~ B) ~ (A & — iB) V (— iA & B) ~ 

(A V B) & (— iA V -iB) ~ (A V B) & ~i(A & B) [which expresses 

exclusive disjunction] ~ (— iA 3 B) & (A D — iB) ~ 

(iB 3 A) & ( B D — i A) ~ (— i A ~ B) ~ (A ~ — iB). 

*§ 6. Model theory: duality. 21 Theorem 7°. (Duality.) Let E and 
F be formulas of the type described in Theorem 6. Let IT, F :/ come from 
E, F by interchanging & with V.‘' J Then'. 

(a) If F — iE, then F E'. (b) //F E, then F -iE'. 

(c) //N E ~ F, then F E' ~ F'. (d) //F E 3 F, then F F' 3 E'. 

20 Although by *31 the association is ordinarily immaterial to us, we can for definite- 
ness regard A t & . . . & A,„” for m > 3 as an abbreviation for 
(. . . ((A! & An) & A ;) ) ... & A„_ ,) & A,„. For m = 1, “A x & ... & A m ” means simply 
A,. Similar!) with *32 and V. (For in = 0, cf. Footnote 248.) 

Starred sections can be omitted without loss of continuity (references from un- 
starred sections will be incidental). This § 6 is not required for later starred sections 
(except for the end of § 19). 
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Proof, (a) Assume F ~iE. By Theorem 6 with Corollary Theorem 5 
(or Exercise 4.6), F E f . Thence by Theorem 1, F E f * where * indicates 

the substitution of — iP,, . - - , ~ iP„ simultaneously for the atoms Pj 

P„. Finally, by *49 with Corollary Theorem 5, F E T * J where * indicates 
the removal of a double negation before each prime part which in E was 
unnegated. But E f * 1 is E', as the following example illustrates. 


F -iE. 

F -i( P & -1 P). 

t= E f . 

F “I PV P. 

F E f *. 

F — 1— iP V “iP . 

F E^, i.e. F E'. 

F P V nP . 


(b) Assume F E. Then by *49 with Corollary Theorem 5, F thE. So 
by Theorem 6 and Corollary Theorem 5, 1= ~iE f . Thence 1= “iE t *. Thence 
F -lE 1 **, i.e. F ~iE'. 

(c) Assume F E ~ F. Then by Theorem 5, 1= “iE lF. So by Theorem 6, 
1= E f ~ F f . Thence F E f *~ F f *. Thence tE’*^ F f *% i.e. F E'~F'. 

If we had established 4a, 4b, *31, *33, *35, *37, *39, *50 and duality 
(Theorem 7), but not yet 5a, 5b, *32, *34, *36, *38, *40, *51, the 
latter would follow by duality and substitution (Theorem 1). For example, 
by *50 with P as the A, F -i(P & -iP). Thence by duality (Theorem 7 (a)), 
t= PV -iP. Thence by substitution, t= AV “iA, which is *51. 

The effect of using Theorem 1 (substitution) with Theorem 7 is to allow 
the duality transformation to be applied to a resolution of E (or of E, F) 
into components A u . . . , A„ not necessarily prime, which must then 
retain their identity (be “treated as prime”) throughout the transforma- 
tion. (Similarly with Theorem 6 and Corollary.) — 

Suppose a visitor from Mars is confused by what he observes upon his 
arrival on Earth, and mistakes our true “t” for false “F”, and our false 
“f” for true “T”; i.e. let F = t and T = f. Then our table for & would 
for him read as our table of V for us, and vice versa. To see this, let us 
view the tables in the square arrangement (available in the case of two 
components), in which properties of the tables are more easily visualized. 
Table (I) is our table for &; (2) is the same rewritten using F = t and 
T = f; (3) is (2) rearranged to the normal order of T first and F second 

(1) (2) (3) (4) 

A & B /A&B A & B A VB 


B 

t 

f 

B 

F 

T 

B 

T 

F 

B 

t 

f 

A t 

t 

f 

A F 

F 

T 

A T 

T 

T 

A t 

t 

t 

f 

f 

f 

T 

B 

D 

F 

T 

F 

f 


D 
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(according to the Martian’s ideas). Now observe that Table (3) looks just 
like our table (4) for V, except that it is written in the capitals T, F instead 
of the small letters t, f. The table for —i written with T, F in the Martian’s 
normal order will look just like our table for ~ i written with t, f, as the 
reader may verify. 

These observations suggest new proofs of Theorems 6 and 7. 22 Also 
they suggest how to avoid excluding — > and D from the formulas E, F 
(and our restriction on ~i was inessential above). We need simply add to 
our symbolism two new propositional connectives ^ and <t, choosing 
the tables for A B and A B so that they will look to the Martian 
as the tables for ~ and D, respectively, look to us. The reader may verify 
that this is accomplished if A ^ B has the table for -i(A ~ B) and A cf B 
has the table for ~i(B 25 A). We may, if we wish, regard these as temporary 
additions to our symbolism, used while applying duality, and then elimi- 
nated by rewriting each part A^Bas ~i(A ~ B) (or A ~ ~iB, by Exercise 
5.3 (b)) and A <£ B as -i(B D A) (or B & ~iA). 

We now prove Theorem 6a° (= Theorem 6 when E may be any 
formula, even containing and c£, and + is the operation of interchanging 
~ with r*-, and 25 with Cf, and & with V, and of changing by one the 
number of -Vs on each atom). By Theorem 4 (b) it will suffice to show 
that -iE and E f have the same table, in computing (any given line for) 
— iE, we first compute a value for E (from the t’s and f’s assigned to the 
atoms Pj, . . . P„) using our tables for ~, nc, 25, tf, &, V, ~i, and then 
(by the ~i of — iE) we change the resulting t or f to T or F. In computing 

E\ we first change the t’s and f’s assigned to Pj P„ to T’s and F’s 

(by the change by one in the number of -Vs on Pj, . . . , P n ), and then 
(because of the' interchange of ~ with and 15 with Cf, and & with V) 
we do the same computation using the Martian’s tables with T, F as before 
we did using ours with t, f. Thus the two computations differ only in 
whether we change t, f to T, F at the end or at the beginning. 

We prove Theorem 7a° (= Theorem 7 similarly extended), thus. 

(a) {F — iE} = {all lines in the table for ~iE have t} = {all lines in the 
table for -iE' have T} s {all lines in the table for ~iE' have f} = {all 
lines in the table for E' have t) = {F E'}. 23 

- z The preceding treatment is basically as in IM pp. 121-124, which was inspired by 
Hilbert and Ackermann 1928 Chapter 1 § 5. The following treatment is inspired by 
Church 1956 § 16 pp. 106-108. Duality in logic was first recognized by Schroder 1877. 

23 For brevity, we are using “ = ” for “(is) equivalent (to)” or “if and only if”, with 
the chain method, in the observer’s language. We prefer the symbol “=” different from 
in order to keep clear the distinction between the two languages. 14 

In (c), we similarly use for “implies” or “only if” in the observer’s language, 
with the chain method but lacking symmetry ; thus A„ -*• A, -*■ A z -* A, implies A, — ► A, 
only for i < j. 
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(c) {N E ~ F} - {L — i(E ~ F)'} [(b)] - {1= -i(E' op F')} — 

(t= — i — i(E' ~ F')} [Theorem 4 with the table for op, and Corollary 
Theorem 5] -*• (1= E' ~ F'} [8 in Theorem 2 with Theorem 3]. 

Exercises. 6.1. (a) Prove Theorem 7 (d). (b) From the proof of Theorem 
7a (a), infer (b). (c) Prove Theorem 7a (d). 

6.2. By applying Theorem 7a (c) (with P, Q, for A, B), extend the list 
in Exercise 5.3 (a) of “equivalents” of A ~ B. 


§ 7. Model theory: valid consequence. We started this chapter by 
saying that logic has the important function of saying what follows from 
what, and thus of saying what propositions are theorems for given axioms. 
Yet thus far we have dealt only with tautologies, i.e. valid formulas, which 
logic asserts to hold without regard to any extra-logical assumptions 
whatsoever. 

Still keeping in mind that in the propositional calculus we do not look 
at the internal structure of the atoms (or will not know the propositions 
they express), let us suppose that we are given from outside the proposi- 
tional calculus that a formula A is true by assumption or fact. That is, 
we may be told that it is an axiom of s ome abstract th eory (like geometry 
or group theory), so it is true by fiat for the purpose of that theory. Or it 
may be a proposition which is true in physical fact or by intuitive mathe- 
matical reasoning. Flow does this alter our position with regard to what 
formulas we can assert to be true by use otherwise of only the propositional 
calculus? 

Consider an example; say A is (P D Q) & (P V R) (Table (h)). 


p 

Q 

R 

(h) 

(P r> Q) & (P V R) 

(0 

QVR 

(j) 

P3R 

(k) 
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Remember that what P, Q and R really are is top-secret information, and 
practitioners of the propositional calculus are not cleared for it. Never- 
theless, if we are told that (P D Q) & (P V R) is true, we have been told 
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something. Namely, we then know that the truth values of P, Q, R must 
form one of the four assignments (Lines 1, 2, 5, 7) which give t to 
(PDQ) & (P V R) in Table (h). So now, in trying -to decide what other 
formulas B are true on the basis of the propositional calculus plus the 
information that A is true, we need consider only these four assignments. 
Thus, upon being given that A is true, we know that Q V R is true because 
its table (i) has only f’s in Lines 1, 2, 5, 7; but we still do not have enough 
information to know whether P 3 R is true, because its table (j) has f in 
Line 2. 

T his leads us to the following definition. Consider two formulas A and 
B, and let Pi, ... , P„ be the atoms occurring in A or in B. We say that B 
is a valid consequence of A (in, or by, the propositional calculus), or in 
symbols A k B, if, in truth tables for A and B entered from P l5 . . . , P m 
the formula B has the value t in all those lines in which A has t. 

P Thus, as we have just observed, (P 3 Q) & (P V R) k Q V R but not 
/ (P 3 Q) & (P V R) k P 3 R. 

I We note that “A k B” is a stronger statement than “If k A, then 1= B”; 
by this we mean that the first statement always implies the second, but 
the second may hold without the first holding. 

To see that the first always implies the second, assume the first “A k B” 
and the hypothesis “k A” of the second. Then (by A k B) B has f in all 
those lines in which A has t; and (by k A) these are all lines; so k B. 

When A, B are (P 3 Q) & (P V R), P 3 R, the second statement “If 
k A, then k B" holds as a material implication (§2) since “k A” is false; 
but as we observed above “A k B“ does not hold. The point is that, when 
not k A, so it is not the case that A is t in all lines, then “If k A, then k B” 
holds automatically, while “A k B” holds only if B is t in those lines if any 
in which A is t. 

Now suppose tn formulas A 1( . . . , A„ t are given. Generalizing from the 
case m — I, we define: B is a valid consequence of A 1( . . . A,„ (in, or by, 
the propositional calculus), or in symbols A„ . . . A m k B, if, in truth 
tables entered from a list P t , . . . , P n of the atoms occurring in one or 
more of A], . . . A m , B, the formula B is t in all those lines in which 
A[, • • • , A„, are simultaneously t. The symbol “k” may be read 
“entail(s)”. 

Not only is it obviously immaterial here in what order the atoms 

occurring in any of A 1? A„„ B are listed as P 1; . . . , P„. But also by 

beginning § 4, the outcome will be the same if the tables for A 1( . . . A,„, B 
are entered front a list P,, . . . , P„ including still more atoms. 

Inspection of the tables shows that (i), (j) k (k) (Lines 1, 3, 5, 6, 7); 
Lh (j), (1) k (k) (Line 3); (h), (1) k (i) (there are no lines in w'hich it needs 
to be checked that (i) is t); but not (i), (j) k (h) (e.g. Line 3). 
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Theorem 8. (a) A (= B if and only if 1= A 3 B. (b) More generally, 
for m> 1 : Aj, . . . , A^, A m k B if and only if A„ . . . , A m _, t= A,„ 3 B. 

Proof, (a) Consider tables for A, B, A3B entered from a list 
P 1( . . . , P„ including all their atoms. Those lines in which A is f do not 
matter for whether A i= B, and in those lines A 3 B is t anyway (by the 
table for 3). So consider the remaining lines, i.e. the lines in which A is t. 
If A 1= B, then B is t in these lines; so by the table for 3, A 3 B is t in 

these lines (as well as the others); so t A 3 B. Conversely, if t= A 3 B, 

then A 3 B is t in these lines (as well as the others); so by the table for 
3, B is t in these lines (the lines for which A is t); so A 1= B. 

(b) for m > 2. Consider tables for A 1? . . . , A m , B, A 3 B. We reason 

as before, with A„, as the A, except that we confine our attention through- 
out to only lines in which A^ . . . , A,,,^ are t. 

Corollary. For m > 1 : A„ . . . , A Wi _ 1( A,„ 1= B if and only if 
N A t 3 (. . . (A m _ 1 3 (A m 3 B)) . . .). 

Proof. By m successive applications of the theorem. 

By Corollary Theorem 8, the problem of what formulas are valid con- 
sequences of given formulas A 1( . . . , A m is reduced to the problem of 
what formulas are valid. Th is is one reason why tautologies are import ant. 

One might reverse the argument and consider this as a reason why~the 
valid consequence relationship is unimportant. However, the valid con- 
sequence relationship corresponds more directly to the way we ordinarily 
use logic. Many manipulations are easier to make in terms of valid con- 
sequence relationships than when these relationships are condensed by 
Corollary Theorem 8 into the validity of i terated implicatio ns^ 

For reasons which will appear later, we prefer to emphasize these 
manipulations in another context, that of “proof theory”, which we will 
begin studying in § 9. Therefore we relegate the further development in 
terms of valid consequence to the exercises, which may help to make some 
of the manipulations more meaningful when we take them up in proof 
theory. 

Exercises. 7.1. (a) Find all the true statements “(h) 1= B” and ‘ (h), 
(j) 1= B” where B is one of (h)-(i). (Counting trivial ones like “(h) t= (h)”, 
there are six.) (b) Prove that for every formula B: (h), (1)1= B. (c) Prove 
that for every formula B: (i), (j) 1= B if and only if (k) t= B. 

7.2. Verify by truth tables: (a) P, P3Qt=Q. (b) P, QhP&Q. 
(c) P & Q 1= P. (d) P & Q 1= Q. (e) P, -iP 1= Q. (f) not P 3 Q 1= Q 3 P. 

7.3. Show that, with notation as in Theorem 1 (but with w + 1 formulas): 
If A 1; . . . , A,„ 1= B, then A*, . . . , A* 1= B*. (Hint: use Corollary Theo- 
rem 8.) 

7.4. (a) Apply Exercise 7.3 to generalize Exercise 7.2 (a)-(e) from P, 
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Q to A, B. (b) Thence by Theorem 8 and Corollary reprove 3, 4a, 4b and 
* 10a of Theorem 2. 

7.5. For ni > 1, show that: (a) A l5 . . . , A m f= B ;/ and only if 

A J&...& A,„ 1= B. 2# Thence by Theorem 8: (b) A,, . . . , A m b B // and 
only i/l A|&. . . 4 A,„ D B. (This gives us an alternative to Corollary 
Theorem 8.) 

7.6. Establish the following. 

(i) For in > ! : (ii) For m,p > 0: 

Aj, . . . , A,„ b Aj, If A It . . . , A m b Bj, 

• * • * • • • > 

An , A,„ b A,„. An ... , A,„ b B„ W B„ . . . , B„t= C, 

then A lf . . . , A,„ b C. 

7.7. Show directly from the definition of valid consequence: 

(a) If A b B W A b — iB, then b -iA. (Reductio ad absurdum.) 

(b) If A b C fl/uf B b C, then A V B b C. (Proof by cases.) 

7.8. Do Exercise 7.7 instead from Theorems 2, 8 and 3. 

7.9. Observe that the reasoning in §§ 4, 5 (for the chain method) holds 
good when we confine our attention to assignments (i.e. lines of the tables) 
for which a given list of formulas A,, . . . , A,„ are all t; so Theorem 3, 
Theorem 5 (and Corollary), and (a)-(O, and thus the chain method, hold 
good with “b” replaced throughout by “Aj, .... A,*b”. Now show that: 

(a) P ~ -iQ b P & ~iQ ~ P. (Hint: cf. Theorem 2.) 

(b) (PVQ)~ -i(P & Q) b (P V Q) V (— i(P & Q) & P) ~ (P V Q). 

*§ 8. Model theory: condensed truth tables. We used the idea of 
the truth tables to define when a formula E is valid (in symbols, b E) and 
when a formula B is a valid consequence of formulas Aj, . . . , A m (in 
symbols, A,, . . . , A„, b B). The tables themselves have been used (often 
with shortcuts, as in Exercise 2.3) in illustrations and in the original 
proofs of the results in Theorem 2 and some other results. But hereafter, 
to establish that b E or that A 1; . . . , A m b B, it will ordinarily be more 
efficient to employ theorems about validity and “valid consequence” than 
actually to compute truth tables. In §§ 4 and 5 we began, and we shall 
continue, to develop techniques for using such results systematically in 
lieu of truth tables. If we wish to show that not b E or not A„ . . . , A m b B, 
we need to compute only one suitably chosen line of the table(s). Often 
we can spot such a line with a little trial and error, without computing 
the full table(s). 

A formula E is called inconsistent or contradictory or identically false, 
if it has a solid column of fs in its truth table; contingent, if it is neither 
valid nor inconsistent. Thus formulas fall into three classes with respect 
to the presence of t’s and fs in their truth tables. 
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valid 

contingent 

inconsistent 

all t’s 

some t’s and some f’s 

all f’s 


invalid 


consistent 

A formula E is inconsistent or consistent, according as -iE is valid or 
invalid. To establish that a formula is contingent, computation of two 
suitable lines would suffice. 

If, notwithstanding, we should find we need to do much truth-table 
computation, it will be worthwhile to seek further economies in the 
writing and computing of the tables. 24 We used 8 lines in writing the 
tables for formulas with 3 atoms; with a dozen atoms, 4096 lines would 
be required similarly. 

Consider the table (1) in §2 for P D (Q V R => (R => -iP)). Since the 
value t is common to the last four lines, those lines can be replaced by one. 
Likewise, the first and third, and also the second and fourth, lines can be 
combined. Thus (1) condenses to: 


P R 

PD(QVR3(R3-iP)) 

t t 

f 

f 

t 

f 

t 


In § 2, we observed a shortcut in the computation of the table for 
P3(QVR3(R3 -iP)) which gave the t’s in the last four lines of (1) 
en masse. This shortcut is an instance of a technique that is advantageous 
generally. The technique consists in assigning a value t or f to just one of 
the letters and computing as far as we can with that, then repeating with 
another letter, etc. (instead of assigning values to all of the letters first, 
and computing second). Consider the basic table for any binary proposi- 
tional connective O. If we pick a value (t or f) for A, then, with that value 
fixed, AO B has the table of a unary connective applied to B. There are 
only four possible tables for a unary connective, with entries respectively: 
(i) t, t, (ii) t, f (the same as the values of B), (iii) f, t (the same as -iB), 
(iv) f, f. So, after picking a value of A, A O B can be evaluated as one of 

21 If the number of tables or their complexity is very great, the use of modem high- 
speed computing machines should be considered. How best to put truth-value problems 
on the machines must take into account features of the machines, and belongs in the 
area of "computer sciences”. Cf. Wang 1960. 
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t, B, — iB, f. C'onsidcring the actual cases that interest us, we obtain the 
following tables. 

B ~ A B&ABVA 

A A — B A 3 B BDA A & B A V B ~iA 

(1) 

t B B t B t f 

f — iB t “iB f B t 

Now wc use this technique on the previous example. 

P 3 (Q V R 3(R3mP)) 

t 3 tQ V R 3 (R 3 it)) f 3 (Q V R 3 (R 3 -if)) 

Q V R 3 (R 3 f) t 

(3) QVR3-iR 

Q V t 3 — it Q V f 3 “if 

t 3 f Q 3 t 

f t 

In the first line is the formula before values have been assigned to any of 
the letters. In the second line, P has been assigned t and f in the left and 
right columns, respectively. The tables in (I) are then used to simplify the 
resulting expressions to Q V R 3 -|R in the left column (by three steps) 
and to t in the right column (by one step). Continuing, in the fourth line 
of the left column, R is assigned t and f in the left and right subcolumns, 
respectively. The figure (3) is called a truth-value analysis by Quine 1950, 
who was apparently the first to emphasize the advantages of computing 
on one letter at a time. The table (2) can be read off the analysis (3) (and 
indeed Quine in 1950 never writes (2) at all). 23 

A good rule of thumb is to select each time, for assignment of t or f, 
a letter occurring frequently. This will tend to promote rapid simplification, 
as each binary combination in which it occurs must disappear. By looking 
ahead a bit, one may be able to recognize cases when a different choice 
would be better. 

Steps can be saved by using *49 (with Theorems 4, 5) to suppress double 
negations, either present initially or introduced in using (1). Thus if B is 
-iC, f ~ B simplifies by (I) to -i~iC, which further simplifies by *49 
to C. 

The method of (3) is not guaranteed to bring us directly to a most con- 
densed table, like (2). If we treat Q second (instead of R), we come out 
with a 5-linc table. 

“ The tables (I) appear in a different format as *40a-*48 in Theorem 22 § 24 below. 
(They appear as *41-*48 on p. 118 of IM, which was written without knowledge of 
Quine 1950. But their use, as on p. 474, was not emphasized in IM.) 
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The formula {P 3 (Q V R 3 (R 3 iP))} & {((Q 3 P) 3 Q) 3 Q} admits 
the table (2); but no order of treating the letters gives a 3-line 
table directly, Of course, afterwards we can combine lines, as we did to 
get (2) from (1). If we first notice that ((Q 3 P) 3 Q) 3 Q is valid 
(Exercise 2.3 (a)), we can pass to P 3 (Q V R 3 (R 3 iP)), and thence 
to (2). 

Indeed, the method of (3) will never produce a table of less than two 
lines, since it requires an assignment of t or f to one of the letters P,- to 
get started. But ((Q => P) 3 Q) 3 Q, being valid, admits the l-line table: 



Full truth tables, preferably written in condensed form, can be used in 
simplifying formulas. Suppose a complicated formula E arises in some 
problem, and we wish to investigate valid consequences of E, or more 
generally to investigate whether various relationships A u . . . , A,„ N B 
hold in which E is a specified one (or a part of one) of A t , . * , , A m , B. 
For these purposes, E can be replaced by any formula F with the same 
truth table as E (i.e. by Theorem 4, a formula F such that N E ~ F), or 
as we said in § 5 a formula F equivalent to E (in the propositional calculus). 
So if we can find a formula F equivalent to E but simpler than E, our 
investigation will be furthered. 

For example, say E is P 3 (Q V R 3 (R 3 mP)) or 
{P D (Q V R 3 (R 3 -lP))} & {((Q 3 P) 3 Q) 3 Q}, either of which has 
the truth table (2). By starting from (2), and asking what formulas 
have that truth table, we find ones simpler than E, namely (P & ~iR) V “iP 
(which is picked to have exactly the two t’s in (2)), m(P & R) (where P & R 
is picked to have exactly the one f in (2)), -iP V ~iR (thence by De 
Morgan’s law *55b), and P 3 mR (thence by *59). The formula 
(P & -iR) V -iP gives a resolution of the truth of E into two disjoint (i.e. 
non-overlapping) cases (namely: P, R are t, f ; P is f), and mP V ~iR con- 
denses this by allowing the cases to overlap. Thus we could have found 
-iP V -iR without going through -~i(P & R). 

It is worth noting that consolidation and recombination of cases can 
be effected by chains of equivalences, since facility with such transforma- 
tions is useful. Besides results in Theorem 2, we need the following. 20 

*52°. 1= A & (B V -lB) ~ A. *53. t= A V (B & ~iB) ~ A. 

*54. kA&B&-iB~B & ~lB. *55°. t=AVBV-iB~BV ~iB. 
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We give an illustration, starting with an equivalent of 
P D (Q V R D (R D — iP)) read from (!) with only the last four lines com- 
bined: 26 

f= (P & Q & — iR) V (P & -iQ & -iR) V -iP ~ 

(P & -iR & (Q V -iQ)) V ->P [*35 witli *33 (and *31)] ■ — 

CP & ~|R) v -lP [*52] ~(PV -IP) & (— >P V -|R) [*36, *34] ~ 

-lPV-iR [*52, *33]. 

Say that for any formula E we have found a p+q - line truth table 
(condensed or not) with p t’s and q f’s. As illustrated, we can then write 
an equivalent formula D of the form Dj V . . . V D„ (which reduces to 

simply D, if p = 1, and say to Pj & — )P t if p — 0) where Dj D„ 

correspond to the respective t’s, and an equivalent formula C of the form 
C x & . . . & C q (which reduces to C x if q = 1, and say to P x V — )P X if^ = 0) 
where C lt . . . , C Q correspond to the respective f’s. To give another 
example, if E is P ~ Q, then D is (P & Q) V (“iP & ~iQ) and C is 
(-iP V Q) & (P V -iQ) [by Theorem 6 from — i{(P & — iQ) V (iP & Q)}]. 
Such a formula D (a disjunction of conjunctions of negated and unnegated 
atoms) equivalent to E is called a disjunctive normal form of E, and such 
a formula C (vice versa) a conjunctive normal form”. 

By a formula F being “simpler” than E we mean in a pragmatic sense 
that it is easier to comprehend and use, even if it is not shorter or much 
shorter. What is simpler will then depend on what uses we have in view, 
or on what sorts of formulas we have become adept at handling. (Thus 
we scarcely need to simplify P~Q.) Disjunctive normal forms of E are 
useful if we .are endeavoring to infer valid consequences from E (cf. 
Exercise 7.7 (b)) or from E and other formulas, and conjunctive normal* 
forms of E if we are endeavoring to infer E as a valid consequence of other 
formulas (cf. Exercise 7.2 (b)). 

Exercises. 8.1. Establish *52— *55 with P, Q (preparatory to sub- 
stituting A, B by Theorem l) by using (1) with *50, *51 (and Theorem 4). 

8.2. Simplify: 

(a) (P V S ~ — iQ) & (R V Q) & mS D — i((S n R V Q) V P). 

(b) m{(P D R) & S ~ “iR V ~i(R 3 Q)} & (Q 3 — iP) & R. (Write a simple 
equivalent after treating only R and Q.) 

(c) (P D Q) & (— i(Q & R) V P) & -i(R & P). (Treat Q first; and use *55b 

and *36+ *53 or *34+ *39.) <- 

20 In fact, P3(QVRD(RZ) — ]P)) reduces to — iP V — )R simply by using *59 thrice, 
*55a, *38 and *40 (with *32-*34). In general, the reduction of formulas can be per- 
formed by first using *63a (or Exercise 5.3) to eliminate ~ and *58 or *59 (or *55c, *60 
or *61) to eliminate 3, then Theorem 6, then "multiplying out" using *35 (or dually 
*36) as analogous to a(h -f c) = ab + ac in algebra, and finally consolidating and 
recombining as illustrated using *3 1 --*40, *5 2-*55. This technique goes back in essen- 
tials to Boole 1847. Cf. IM pp. 135-136. 
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§ 9. Proof theory: provability and deducibility. The proof of theo- 
rems, or the deduction of consequences of assumptions, in mathemat- 
ics typically proceeds a la Euclid, by putting sentences in a list called a 
“proof” or “deduction”. We use the word “proof” (and call the assump- 
tions “axioms”) when the assumptions have a permanent status for a 
theory under consideration, “deduction” when we are not thinking of them 
as permanent. Each step from some sentences in the list to another is 
mediated by logic, as analyzed above for the case the logic is propositional 
logic. Thus one sentence follows from others if it is a valid consequence 
of those others; this relationship we defined by truth tables in § 7. A sen- 
tence can be put into the list without reference to earlier sentences, if it 
is one of the assumptions or is valid. In the definitions of “valid conse- 
quence” and validity, we stood outside the language of the sentences 
themselves (the object language), and observed in another language (the 
observer’s language) how the sentences (or formulas) are composed from 
atoms. In the observer’s language, we al so develo ped various results^con- 
cernin g validity and the valid conse quence_ relationsh ip. which often arc 
mor e convenient to use than direct ar>Dlica tioiixtf. th£fruth.lables^ \Ve call 
this t reatment of logic “model theory ”, as in it we replace the atoms by 
truth values t and f in all possible combinations, to obtain what can be 
considered models or concrete replicas of what the sentences may 
express. 

Now we shall take up another way of founding logic. This treatment, 
called “p roof theo ry”, arises from ask ing the quest ion whether the proofs 
and deductions of logic itself cannot be given in an analogous way ^But 
J ndwTTihceTnsTogiC'flsHrt hat we would treat in the axiomatic-deductive 
i* manner, the inferences cannot be made by appealing to logical criteria but 
only to specifically stated axioms and rules. In proof theory, some sen- 
tences or formulas will be taken as logical “axioms”, and some “rules of 
inference” will be established for making the inferences from some sen- 
tences to another sentence. 

We now give such a formulation of the classical propositional calculus, 
both by itself and for its application to deduction from assumptions. 
Later we shall show that th e tw o formulat ions, that of model theory and 
that of proof theory, give equivalent results. 

As axioms for our (proof-theoretic) system of the (classical) proposi- 
tional calculus, we take all formulas of any of the forms shown after the 
symbol “t=” in la-lOb ofTheorem 2(and in the “List of Postulates” p. 387). 
These forms themselves we call axiom schemata. Each axiom schema 
includes infinitely many axioms, one for each choice of the formulas de- 
noted by “A”, “B”, “C”. For example, corresponding to la in Theorem 2, 
we have as Axiom Schema la: A 3 (B D A). Particular axioms by this 
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schema are : P 3 (POP),PO(Q 3 P),Q 3 (P 3Q),-iP 3(Q & R 3 -iP), 

(P 3 (-|R 3 P)) 3 [R 3 (P 3 (“iR 3 P))], etc. 

As the sole rule of inference, called the 3-r«/e or modus ponens or the 
rule of detachment, we take the operation of passing from two formulas 
of the respective forms A and A 3 B to the formula B, for any choice of 
formulas A and B (cf. Theorem 3). In an inference by this rule, the for- 
mulas A and A 3 B are the premises, and B is the conclusion. 

We define a ( formal ) proof (in the propositional calculus) to be a finite 
list of (occurrences of) formulas B,, . . . , B, each of which cither is an 
axiom of the propositional calculus or comes by the 3-rule from a pair 
of formulas preceding it in the list. A proof is said to be a proof o/'its last 
formula B,. If a proof of a given formula B exists, we say B is ( formally ) 
provable , or is a ( formal ) theorem, or in symbols P B. 

Example 4. For each formula A, the following list of five formulas 

B t B, is a proof of the formula A 3 A. (Here / = 5, and B x is 

A 3 (A 3 A), .... B 5 is A 3 A.) 27 

1. A 3 (A 3 A) — Axiom Schema la. 

2. (A 3 (A 3 A)} 3 {[A 3 ((A 3 A) 3 A)] 3 [A 3 A]} - Ax. Sch. lb.E> F 

3. [A 3 ((A 3 A) 3 A)] 3 [A 3 A] — modus ponens, 1, 2. 

4. A 3 ((A 3 A) 3 A) — Axiom Schema la. 

5. A 3 A — modus ponens, 4, 3. 

Besides the proof itself B„ . . . , B„ we give at the left numbers for refer- 
ence, and at the right reasons which justify the inclusion of each of the 
formulas B,, . . . , B, in the proof (an “analysis” of the proof). Thus at 
Step 1, we have applied Axiom Schema la with A both as the A and as 
the B of the schema (i.e. with the formula presently denoted by “A” as 
the formula denoted by “A” and by “B” in the statement of the schema). 

In Step 2, A is the A, A 3 A is the B, and A is the C, of Axiom Schema lb. 

In applying modus ponens to Formulas 1 and 2 at Step 3, the A of the rule 
is A 3 (A 3 A) (which is 1) and the B of the rule is 3. — Because 1-5 
constitutes a proof (for any fixed formula A), we can say that A 3 A is 
provable, or in symbols (- A 3 A. Similarly, f A 3 (A 3 A), etc. (Why?) 

A few remarks may help to make this way of formulating the proposi- 
tional calculus as an axio matic-dedu ctive theory seem reasonable. We have 
an infinite number of axioms from each axiom schema, as mentioned 
above. This could be avoided by requiring the language in which the prime 
formulas arc constructed to include single letters as “proposition variables” 
and adding a second rule of inference, the “substitution rule”, to say that 
E* can be inferred from E under the circumstances of Theorem 1 when 


27 What we actually exhibit is a “proof schema”, which becomes a particular proof 
for each choice of the formula denoted by “A”. 
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Pj, . . . , P„ are proposition variables. That procedure seems less in keeping 
with usual mathematical language than the one we have followed (due to 
von Neumann 1927). 28 

The list of thirteen axiom schemata may seem surprisingly long. How- 
ever, each propositional connective must have axioms to characterize it, 
i.e. to provide the deductive properties we want it to have. We are getting 
along with only two or three axiom schemata for each of 3, &, V, "i, 
namely, one or two (left column in Theorem 2) which help us to prove 
formulas in which the symbol is used (i.e. to “introduce” the symbol), 
and (except for ID) one or two (right column) which help us to infer 
formulas not containing the symbol (or not containing it so often) from 
formulas containing it (i.e. to “eliminate” the symbol). In the case of 3, 
the 3-rule provides for “elimination”. This (to our mind) elegant arrange- 
ment of axiom schemata is due essentially to Gentzen 1934-5. 

We could get along with fewer axiom schemata by foregoing the use of 
some of the symbols &, V, ' — ■ as an official part of the object language. 
For example, if each time we wrote “A ~ B” we understood it as an 
abbreviation for (A 3 B) & (B 3 A), then Axiom Schemata 9a, 10a, 10b 
could be omitted. 29 

For the propositional calculus applied to infer formulas from assump- 
tions A x A m , the formulas A l5 . . . , A„, are in effect allowed to func- 

tion as axioms also. However, we shall not call them “axioms” (for the 
propositional calculus), but (when we need a name) assumption formulas', 
and (for m > 0) we shall not call Bj, . . . , B ( a “proof”, but a “deduction” 
from Ad ... , A m . That is, a finite list of (occurrences of) formulas Bj, . . . , 
B, is a ( formal ) deduction (of B,) from A t , . . . , A m (in, or by, the proposi- 
tional calculus), if each formula in the list is one of A x , . . . , A„„ or one 
of the axioms (of the propositional calculus, i.e. by one of Axiom Schemata 
la-lOb), or comes from two earlier formulas in the list by the 3-rule. If 
there is a deduction of a given formula B from A t , . . . , A m , we say that 
B is de ducihle frp m A 1; . . . , A m , or in symbols A lt . . . , A,„ b B. The 
symbol “I-” may be read “yield(s)”. (In using this terminology when 

28 In either case, the rule of inference must have the character of a schema, with the 
Roman capital letters “A” and *‘B” standing for any formulas, in order to provide for 
infinitely many different applications of it. Our axiom schemata can be considered as 
rules of inference with zero premises. Hence Carnap 1934 called the axiom schemata 
and the rules of inference together "transformation rules”. The rules defining the class 
of formulas (§ 1), analogous to the rules of syntax in grammar, are the "formation 
rules”. 

28 This is done in IM. If we took “A & B”, “A V B”, "A ~ B” to be abbreviations 
for — 1 ( A 3 — iB), — ]A 3 B, — )(( A 3 B) 3 — 1 ( B 3 A)), respectively, we would need 
only the four axiom schemata la, lb, 7, 8. There are still other possibilities. Cf. Church 
1956 pp. 119, 136-138. 
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m > 0, “deduction” and “deducible” include “proof” and “provable” as 
the case m — 0.) :i " 

Lxami’u. 5. For each choice of formulas A, B, C, the following 
sequence of 8 formulas is a deduction of C from A^D (B 23 C), A & B. 

1. A & B 2nd assumption formula. 

2. A & B 23 A ~ Axiom Schema 4a. 

3. A - modus ponens, 1, 2. 

4. A3|B3C) - 1st assumption formula. 

5. B 2) C — modus ponens, 3, 4. 

6. A4113 B - Axiom Schema 4b. 

7. B -- modus ponens, 1, 6. 

8. C — modus ponens, 7, 5. 

Thus we can say that C is deducible from A 23 (B 23 C), A & B; or briefly, 
A D (B => C), A & B I- C. 

We said “formal proof” and “formal deduction” in the definitions above 
(though we shall usually omit the word “formal”) to emphasize that these 
proofs and deductions are in the object language, which we are studying 
in the observer’s language (§ 1). From the standpoint of the observer's 
language, we look only at the form of the formulas (in contrast to their 
meaning or content) in determining under the definitions just given whether 
a given sequence of formulas B„ . . . , B, is in fact a (formal) proof, or a 
(formal) deduction from given assumption formulas A,, . . . , A m . A 
sequence of formulas B„ . . . , B ( is a (formal) proof, or a (formal) deduc- 
tion from A,, . . . , A„„ only when it exactly fits the above definition (as 
illustrated in Examples 4 and 5.) 31 This stereotyping of the operations that 
can be performed in constructing a formal proof or deduction makes the 
formal proofs and deductions (in the object language) definite enough in 
structure to serve as objects of our study. 

In our study (in the observer’s language), we •shall also be proving 
theorems, deducing consequences of assumptions, etc. This will be as much 
the case here in proof theory as it was in model theory. In these i nform al 
proofs and deductions, we may operate flexibly, on the basis of the 

30 'The symbol “ 1 ” goes back to Frege 1879; the present use of it to Rosser 1935 and 
Kleene 1934. (Rosser proposed it to express deducibility by the rule(s) of inference, and 
Kleene suggested including also use of the axioms.) The parallel use of “|=” (§§ 2, 7) is 
perhaps original with Kleene 1956a. 

31 In particular, we do not simply allow a formula to be included just because it is 
valid, or to be inferred from preceding formulas just because it is a valid consequence of 
them. If we did so, no proof would need to be longer than one formula, and no deduction 
from A,, . . . , A,„ would need to be longer than m + 1 formulas, but the one inference 
might be enormously complicated. There is an aim in proof theory to analyze the 
inferences into simple ones, as psychologically they must be in our actual reasoning. 
(More on this subject later, pp. 70-71). 
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meanings of our statements, using any inferences that carry conviction. 
(It may indeed be that some of these inferences are the informal counter- 
parts of operations available in the formal proofs, but they are not re- 
stricted to be such.) 32 

The student who keeps in mind that there are the two languages, where 
now (formal) proofs and deductions as well as formulas in the one lan- 
guage (the object language) are being studied, using (of necessity) informal 
proofs and deductions in another language (the observer’s language), 
should have no trouble keeping matters straight. In model theor y, the 
object language was dealt with only as an assemblage oiformulas, whose 
truth tables we investigated ; so there was not quite as much of a parallelism 
of terminology as we shall now have. 

We conclude this section by giving in the observer’s language two easy 
(informal) theorems (and their proofs), which have as their subject or 
object formal proofs and deductions in the object language. 

Theorem 9. 

(i) For m ^ 1 : (ii) For m,p > 0: 

Aj, . . . , A m b An If Aj, . . . , A m b Bj, 

• • • J • ■ • » 

An ... , A m h A m . . Aj, . . . , A m b B c and B„ . . . , B, b C, 
then Aj, ... , A,„ b C. 

Proof, (i) The definition of a deduction B 1( . . . , B, does not require 
that each of the assumption formulas An .... A m actually occur in the 
list Bn ... , Bj. So, for each / from 1 to m. A, standing by itself constitutes 
a deduction of A { from Aj, . . . , A m . (ii) In a given deduction of C from 
B x , . . . , B„, we can replace the occurrences of the assumption formulas 
B x , ... , B p by deductions of B„ . . . , B„ respectively from Aj, .... A m . 
Thereby we obtain a deduction of C from Aj, .... A m . — 

Let An . . . , A,„ be a given list of formulas. Suppose we are exploring 
the class of the formulas B which are deducible from Aj, .... A m . 
Theorem 9 (i) tells us that A 1( • . . , A,„ themselves are in this class. 
Theorem 9 (ii) tells us that any formula C is in this class which is deducible 
from any formulas Bj, . . . , B fl already known to be in this class. 

From this standpoint, the role of Theorem 9 should be clear. However, 
we shall return to this in § 13, after obtaining some practice meanwhile 
with particular applications of Theorem 9. — 

Before continuing, it may be well to contrast the meanings of four 
expressions. 

32 This is not to say that our informal proofs and deductions in the observer’s lan- 
guage need not conform to any logical standards. But we are not now trying to regulate 
them or to study them as specimens of logic. 
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“(= A 3 l>" means that the formula A 3 B is valid, i.e. its truth table 
has a solid column of t’s. 

“A b B” means that the formula B is a valid consequence of the formula 
A, i.e. B has t in each of those lines of its truth table in which A has t. 

‘TAD B” means that the formula A ID B is provable, i.e. there is a 
finite sequence of formulas such that each formula of the sequence either 
is an axiom or comes from two preceding formulas of the sequence by 
modus ponens, and the last formula of the sequence is ADB. 

“A h B” means that the formula B is deduciblefrom the formula A, i.e. 
there is a finite sequence of formulas such that each formula of the sequence 
either is A or is an axiom or comes from two preceding formulas of the 
sequence by modus ponens, and the last formula of the sequence is B. 

By the end of § 12, we shall have found that these four expressions are 
equivalent, i.e. if any one is true so are the other three. (For the first two, 
we have this already in Theorem 8.) 

Likewise, we shall find that “If 1= A, then h B” and “If b A, then b B” 
are equivalent. These two expressions are weaker than the preceding four, 
as we have already seen in § 7 for the ones with “N”. 

Thiorum 10. (a) If I- A 3 B, then A b B. (b) More generally, for 
any in > 1 : //A,, .... A w ^ b A m 3 B, then A,, v . . , A,,,^, A,„ b B. 

Prool. (b) By hypothesis, there is a deduction of A„, 3 B from 

A, A,„_! (say it has k formulas). Using this, we can construct a 

deduction of B from Aj, . . . , A m _ 1( A m as follows. 

I the deduction of A,„ 3 B from A 1( . . . , A m _j 
given by the hypothesis of the theorem. 

A+l. A,„ — /nth assumption formula. 
k + 2. B — modus ponens, A'+l, k. 

Corollary. If b 3 ( . . . (A m _, 3 (A,„ 3 B)) . . .), then 
A x A„, ,.A„, b B. 

Exlrcisi-s. 9.1. Add to 1-5 in Example 4 to make a proof of A ~ A. 
9.2. The following is a deduction of C from A, B, A3(B3C). 
Supply the reasons (or “analysis”), and state the result using “b”. 

1. A. 

2. A 3 (B 3 C). 

3. B 3 C. 

4. B. 

5. C. 
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9.3. By constructing appropriate deductions, show that: 

(a) A, A 3 B b B. (b) A, B b A & B. (c) A & B I- A. (d) A & B b B. 

(e) A b A V B. (f) B h A V B. (g) n-iA b A. 

(h) A 3 B, B 3 A b A ~ B. (i) A ~ B b A 3 B. (j) A ~ B b B 3 A. 

9.4. Supply missing hypotheses (and justify them) or conclusion in the 

following applications of Theorem 9 (ii): 

(a) A, — lA I — i — i B and — i — iB b B; therefore (m = 2 , p = 1.) 

(b) and A & B b A; therefore A 3 (B 3 C), A & B b A. (m = 2, 

p = 1.) 

(c) A 3 (B 3 C), A & B b A and A 3 (B 3 C), A & B b B and and 

A, B, A3(BDC)bC; therefore A3(BDC), A&BbC. (m = 2, 

P = 2.) 

(d) and and A, A 3 B b B; therefore A, ~ iB, A3B b B. 

(m = 3, p = 2). 

9.5. Corresponding to Exercise 9.4(c), illustrate the proof of Theorem 
9 (ii) by combining the deductions of Exercise 9.3 (c) and (d) (construed as 
deductions from A 3 (B 3 C), A & B) with that of Exercise 9.2. Compare 
your result with Example 5. 

9.6. Write out the m = 0 and p = 0 cases of Theorem 9 (ii) and adapt 
the proof to them. 

9.7. Show how the result of Exercise 9.1 (i.e. bA~A) comes by 
Theorem 9 from bADA. (Example 4) and Exercise 9.3 (h). (Take 
m = 0.) 

9.8. Show that: If b A x & . . . & A,„ 3 B, then A x , . . . , A„, b B. 20 

§ 10. Proof theory: the deduction theorem. The property of 
deducibility expressed by the next theorem corresponds to a familiar 
method in our informal reasoning. To establish an implication “If A, 
then B”, we often assume A “for the sake of the argument” and undertake 
to deduce B. This method is also available in the presence of other 
assumptions A u . . . , A m _ x . 

The proof is longer than that of the preceding theorems (except 
Theorem 2, if all the computations are included). But it has a simple plan, 
after which the rest of the work falls into four simple cases. 

Thbokem 11. (The deduction theorem, Herbrand 1930.) 33 (a) If 
A b B, then b A D B. (bf^An^f. , A m _ x , A m b B, then A,,..., 
A„,-i b A,„ 3 B. 

31 The deduction theorem as an informal theorem proved about particular systems 
like the propositional calculus and the predicate calculus (Chapter II) first appears 
explicitly in Herbrand 1930 (and without proof in Herbrand 1928); and as a general 
methodological principle for axiomatic-deductive systems in Tarski 1930. According to 
Tarski 1956 footnote to p, 32, it was known and applied by Tarski since 1921. 
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Prooi . (b) The student should first review exactly what the hypothesis 
and conclusion mean under our definition of “f”. (Each asserts the 
existence of a certain kind of a finite list of formulas, say Bp . . . , B, and 
Lip . . . , B',. There are two differences in the specifications for these two 
lists. In the first list but not in the second, a formula may be inserted on 
the ground that it is the with assumption formula A,„. In the first list the 
last formula must be B, in the second A„, 3 B.) 

We must show that, whenever we are given a deduction of B from 
A,, . . . , A,„ i, A,„ (the “given deduction”), we can find a deduction of 
A m 3 B from A,, . . . , A,„_p There is in fact a uniform method by which, 
from any such given deduction, we can always find a deduction of A m 3 B 

from A, A„, , (the “resulting deduction”). We describe this method 

in general terms now, and in Example 6 below we illustrate it. 

Say the given deduction is 

(a) Bp ... , B„ 

so B, is B. (In Example 6, (a) is the left column, consisting of Formulas 
1-5.) As the first step toward constructing the resulting deduction, we 
prefix to each formula of the given deduction (a) the symbols A„, 3, 
supplying parentheses as appropriate. Thus we obtain 

(P) A,„ 3 Bp ... , A m 3 B„ 

which docs have hist the formula A m 3 B which must be last in the 
resulting deduction. (In Example 6, ((3) is 3', 8', 11', 14', 17' in the right 
column.) But this sequence ((3) is not in general a deduction from 
A,, . . . , A,„_ t . However we can insert some additional formulas into it 
before each one A m 3 B, of its formulas (i = I, ... . , /) so that it will 
become a deduction (y) from A 1; .... A„,_j. For each i, the choice of the 
formulas to be inserted before A„, 3 B, depends on the reason given for 
the inclusion of B s in the given deduction (a). 34 

Case 1 : B, is one of the first m— 1 assumption formulas Ap . . . , A m _p 

which are retained as assumption formulas for the resulting deduction; 
say B, is A ,■ ( j < m). Then we insert the first two of the following formulas 
before the third, which is A m 3 B ( . 

k'. A, — /th assumption formula. 

A-+T. A, 3 (A„, 3 A ; ) — Axiom Schema la. 
k + 2'. A,„ 3 A, — modus ponens, k', k+V. 

34 Thus the uniform method applies to the given deduction B 1 , . . . , B, as a sequence 
of formulas, together with a reason for the inclusion of each fo/mula in it. These reasons 
we call an analysis of a deduction. More than one analysis may be possible; e.g. in 
Example 6 (where A, B, C are understood to be any formulas, not necessarily distinct), 
if C is the same formula as A, another analysis would justify Formula 3 by Axiom 
Schema la. 
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(In Example 6, this is illustrated by l'-3' with k' = T, and again by 
9'-ll' with *' = 9'.) 

Case 2: is the last assumption formula A m , which will not be 

retained as an assumption formula for the resulting deduction (unless A m 
happens to be the same formula as one of A t , . . . , A m _i). We insert the 
first four of the formulas in the proof of A 3 A in Example 4 for A m as 
the A. (In Example 6, this is illustrated by 4'-8', A m being A so the 
insertions read exactly as in Example 4.) 

Case 3: B, is an axiom. Treated similarly to Case 1. (Not illustrated 

in Example 6.) 

Case 4: B s comes from two preceding formulas B, and B A (g, h < i ) 

by modus ponens. We leave it to the reader as an exercise to work out the 
treatment of this case, and to supply the insertions 12', 13' and 15', 16' in 
Example 6 (Exercise 10.1). 

Example 6. To illustrate the proof of the deduction theorem, we give: 
(a) in the left column, a deduction of C from A 13 (B 3 C), B, A; and 
(y) in the right column, the deduction of ADC from A 3 (B 3 C), B 
resulting from that given deduction by the uniform method described in 
the proof of the theorem. 

I'. B — 2nd assumption formula. 

2'. B 3 (A 3 B) — Axiom Schema la. 

1. B — 2nd ass’n. formula. 3'. A 3 B — modus ponens, T, 2'. 

4'. A 3 (A 3 A) — Axiom Schema la. 

5'. (A 3 (A 3 A)) 3 

{[A 3 ((A 3 A) 3 A)] 3 (A 3 A)} 

— Axiom Schema lb. 

6'. [A 3 ((A 3 A) 3 A)] 3 [A 3 A] 

— modus ponens, 4', 5'. 

7'. A 3 ((A 3 A) 3 A) - Ax. Sch. la. 

2. A — 3rd. ass’n formula. 8'. A 3 A — modus ponens, 7', 6'. 

9'. A 3 (B 3 C) — 1st assumption 
formula. 

10'. {A 3 (B 3 C)} 3 

(A 3 (A 3 (B 3 C))} - Axiom 
Schema la. 

3. A3(B3C)- 1st. a. f. 1 1'. A 3 (A 3 (B 3 C)) - m.p., 9', 10'. 

12 '. 

13'. P V\ ~ * C ct + J-I r 1- ' 

4. B 3 C — m. p., 2, 3. 14'. A 3 (B 3 C) - m. p„ IT, 13'. 

15'. 

16'. 

5. C — modus ponens, 1, 4. 17'. A 3 C — modus ponens, 14', 16'. 
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(A simplification applicable to this particular example leads to a 
shorter deduction of A 3 C from A3(B3 C), B. Indeed, the 7 formulas 
1', 2', 3', 9', 15', 16', 17' suffice in place of the 17 which our uniform 
method gives.) 

Applying the uniform method to 1 '— 1 7' as the given deduction leads to 
a deduction 1 "—53" of IO(A3C) from AD(B3C); and application 
of the method to this in turn leads to a proof 1 J 6 1 of 
(A3(IDO)3(B3(A3C)). (Cf. Exercise 10.2.) 

We wrote out the 17-formula deduction of A 3 C from A3(B3 C), 
B resulting by our method applied to 1-5 in order to help the reader 
visualize the proof of the deduction theorem. But now that the deduction 
theorem is proved, we shall be satisfied to use it to infer the existence of 
deductions and proofs without actually constructing them. Thus actual 
construction of the deduction 1-5 (left column) establishes that 
A3(B3C), B,A I- C. Thence it follows by three successive applications 
of Theorem 1 1 that A 3 (B 3 C), B b A 3 C, that 
A3(B3C) h B 3 (A 33 C), and finally that 

t(A3(H3C))3(B3(A3C)), i.e. that there exists a proof of the 
formula (A3(B3C))3(B3(A3C)). This satisfies us; we have no 
interest in actually seeing a proof of that formufa, least of all the 161- 
formula proof resulting by three successive applications of our unilorm 
method to the 5-formula deduction in the left column of Example 6. 
(There are shorter proofs of (A 3 (B 13 C)) 3(B3(A3 C)). Using the 
simplification of 1 '— 17' noted above before reapplying the uniform method, 
we get one having 7 1 formulas.) But while our uniform method in the proof 
of Theorem 1 1 may be uneconomical for constructing deductions and 
proofs, it is efficient for proving Theorem 1 1 ; and Theorem 1 1 itself is 
very efficient for establishing their existence. We believe the reader right 
after Example 5 would have found it a fairly difficult exercise to construct 
(or show the existence of) a proof of (A 3 (B 3 C)) 3 (B 3 (A 3 C)). 

Changing the letters in the preceding illustration (which we can do 
since A. B, C were arbitrary formulas, e.g. the'B, A. C of the next), 
h(B3(A3C))3(A3(B3C)). Now using Axiom Schema 9a and 
modus ponens twice, E A 3 (B 3 C) ■ — - B 3 (A 3 C) (cf. *3). 

Lxampli: 7. Similarly, applying the deduction theorem to the result 
of Example 5, A 3 (B 3 C) E A & B 3 C and 
E (A 3 (B 3 C))3 (A &B3C). 

Corollary. // A,, . . . , A,,, .,, A,„ E B, then 
EA 1 3(...(A W ‘, I 3 (A m 3 B ))...). 

Exlrcisi s. 10. 1. Treat Case 4, and supply 12', 13', 15', 16' in the right 
column of L .sample 6. 

10.2. Show that, when the given deduction has I formulas, the resulting 
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deduction has 3/ + 2 formulas if A w is used as such in it; otherwise, 3/ 
formulas. 

10.3. Show that A&B3CbA3(B3C) (start by constructing an 
appropriate deduction) and (using Example 7) 

I- A =3 (B 3 C) ~ A & B 3 C (cf. *4a). 

10.4. Show that h (A 3 B) 3 ((B 3 C) 3 (A =3 C» (cf. *2). 

10.5. Show that: //A,, .... A,„ f B, then h A, & ... & A,„ 3 B. 

§ 11. Proof theory: consistency, introduction and elimination 
rules. The corollaries to Theorems 10 and 1 1 accomplish the reduction 
of the deducibility notion “A lt . . . , A,„ h B” to the provability notion 
“f E” in a manner parallel to the reduction of the notion of valid con- 
sequence “A,, .... A,„ f B” to the notion of validity N E given in 
Corollary Theorem 8. 

He nce, if we can show that “b E” and “f E” are equ ivalent, w e sha ll 
then ha ys completed the d emonstration o f the equivalence of proo f 
theor y and model theo rv^for the P E&DQsitioniU . calculus, both as used in 
treating absolute logical truths and when applied under assumptions 
A„ . . . , A,,,. We do this in Theorems 12 and 14. 

St— «=" 

Jj Theorem 12. Each provable formula E is valid,' using our symbols. 
// if V E, then 1= E. 

Proof. By la- 10b in Theorem 2, each axiom of the propositional 
calculus is valid. By Theorem 3, given that the premises Aand A 3 B for an 
application of modus ponens are valid, so is the conclusion B. Thus, as we 
construct a proof Bj, . . . , B ( of E, each of the formulas B t , B 2 , B 3 , . . . 
successively introduced (either as an axiom, or as a consequence by modus 
ponens) is valid. Therefore, the last formula B ; , which is E, is valid. 

Corollary. For no formula B are both B and ~iB provable; using our 
symbols: for no formula B do both H B and I iB hold. 

Proof. Suppose I- B and b~iB for some B. Then by the theorem, 
f B and t= — iB; i.e. B has all t’s in its truth table and so does iB. This is 
absurd, since, by the table for ~i, if B has all t’s then ”iB has all f’s. 

In general, by a "consistency property” of an axiomatic-deductive 
system (to be called a “formal system” in § 37) we mean a property that 
at most certain formulas are provable (e.g. only ones having some desired 
property, or lacking some undesired property). By a "c omp leteness 
p roperty ”, we mean a property that at least certain formulas are pTuvrfble 
(e!g71mhaving a certain desired property). 

Thus Theorem 12 establishes "consistency of the propositional calculus 
! /with respect to validity”, and its corollary establishes the so-called simple 
/ (consistency”. 
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To establish the converse of Theorem 12 (Theorem 14, giving the 
“completeness of the propositional calculus with respect to validity”), 
we shall need to develop the proof theory of the propositional calculus 
for a certain distance. In this development, the deduction theorem 
(Theorem 11) is a most helpful tool. We begin (in Theorem 13) with a 
collection of fourteen rules adapted from Gentzen 1934-5, which we call 
“introductions” and “eliminations” of logical symbols. For completeness, 
we include the deduction theorem itself as “D-introduction”, and modus 
ponens restated using “ f” as ‘O-elimination”. The rest of the rules (with 
one exception, “weak -i-elimination”) amount essentially to restatements 
of the axiom schemata in the light of these two rules. To save space, we 
let “F” stand for a list of zero or more formulas, so we can write ‘T, 
A 1- B” for “A,, . . . , A,,,..,, A,„ b B” with A m = A (r empty for m = 1). 

The rule of -i-introduction (next to the bottom in the left column) 
corresponds to the informal method of “reductio ad absurdum”: to 
prove “not A” or that A is false, we assume A “for the sake of the 
argument” and deduce a contradiction B and “not B”. This argument can 
be carried out in the presence of prior assumptions I\ 

The rule of V-elimination corresponds to the informal method of 
“proof by cases”. If we have established “A or B” or are assuming this, 
then to show that C follows it suffices to show it in two cases, the case that 
A holds and the case that B holds. (For the problem of deducing C from 
A V B, we can thus “eliminate” the V, and attempt the deduction from A 
and from B separately. It is in this sense that we can consider the rule as 
an elimination rule.) 


Theorem 13. for any finite list of ( zero or more ) formulas T, and any 
formulas A, B, C: 

Introduction Elimination 


D If 1', A b B, 

then I 1 b A D B. 

& A, B b A & B. 

V A b A V B. 

B b AV B. 

-> // r, A b B and V, A b -iB, 

then T I — tA. 

(Reductio ad absurdum.) 


A,ADB bB. 

A & B b A. 

A & B b B. 

7/r, A bC and T, B b C, 
then I\ A V B b C. 

(Proof by cases.) 

— i“iA b A * 

((Double) negation elimination.) 0 
A, ~)A b B. 

(Weak negation elimination.) 
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~ A3B,BDA1A~B. A~B 1ADB. 

A~ B h B 3 A. 

Proofs. We already have 3-introduction in Theorem 11, and 
3-elirn., &-introd., &-elim., V-introd., (double) “i-elim. and the three 
'—'-rules in Exercise 9.3. 

V-elimination (proof by cases). 

1. ,r, A 1C - hypothesis. 

2. P, BBC - hypothesis. 

3. r h A 3 C — 3-introd. (the deduction theorem), 1. 

4. r h B 3 C — 3-introd. (the deduction theorem), 2. 

5. A 3 C, B 3 C, AVB f C — using Axiom Schema 6, and modus 
ponens thrice. (That is, we can construct the following deduction: 
1'. A 3 C. 2'. (A 3 C) 3 ((B 3 C) 3 (A V B 3 C)). 

3'. (B 3 C) 3 (A V B 3 C). 4'. B 3 C. 5'. A V B 3 C. 6'. A V B. 

7'. C. The student should supply the reasons.) 

6. F, AVB IC - combining 3, 4, 5 by Theorem 9. (For, 

T, AVB f A 3 C [using 3; cf. Exercise 1 1 . 1], 

T, AVBI-B3C [using 4], 

l 1 , A V B ) A V B [Th. 9 (i)] and A 3 C, B 3 C, A V B F C [by 5]; 

so T, A V B 1 C [by Theorem 9 (ii) with m = 2, p — 3.]) 

Weak ~ i-elimination. 

1. A, -iA, iB h A — (Theorem 9 (i)). 

2. A, “iA, ~iB f — iA — (Theorem 9 (i)). 

3. A, ~ iA 1 — l — iB f B — ~ i-introd., I, 2; ~ i-elim. 35 

By this rule, from a contradiction A, “ iA, any formula B can be deduced. 
The idea of the proof of it just given is to deduce a contradiction from A, 
~ iA, ~iB and blame the contradiction on ~ iB. 

Exercises. 11.1. Infer ‘T, AVB I- A 3 C” from 3 in the proof of 
V-elimination by two methods: directly from the definition of “deduction”; 
by use of Theorem 9 (as in Exercise 9.4 (b) and (d)). 

11.2. Prove the rule of ~ l-introduction (reductio ad absurbum). 

§ 12. Proof theory: completeness. We shall prove the complete- 
ness of the propositional calculus by a method due to Kalmar 1934-5. 
In preparation, we first establish four lemmas. 

Lemma 1. To each entry (or line) in each of the five basic truth tables 
for the propositional calculus in §2 , a corresponding- deducibility relationship 

35 “F 1 A, F A 2 ” is an abbreviation for “P FA, and A, 1 Aj”, from which 
“P f A 2 ” follows by Theorem 9 (ii). Cf. Exercise 9.4 (a). Similarly with longer chains. 
(We use only one formula after each “ F”.) 
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holds. For example, the tables for 3 and — i follow at the left, the corre- 
sponding deducibility relationships at the right. 


A B A 3 B 


t t 

t 

A, B F A 3 B 

(1) 

t i 

f 

A, — iB F — i(A 3 B) 

(2) 

T t 

t 

mA, B F A 3 B 

( 3 ) 

f f 

t 

- iA, ~iB F A 3 B 

( 4 ) 

A 

-iA 



t 

f 

A F -T-i iA 

( 5 ) 

f 

t 

-iA F -iA 

(6) 


(Altogether, 18 deducibility relationships are asserted by this lemma.) 

Proois. For illustration we establish three of the four relationships 
above for A 3 B. 

(1) 1. A, II. A Mi - (Theorem 9 (i)). 

2. A, B F A 3 B — 3-introd. (the deduction theorem), 1. 

(2) I. A. -iB, ADI1 F B — 3-elint. (and Th. 9; cf. Ex. 9.4 (d)). 

2. A, nli, A D B F -iB — (Theorem 9 (i)). 

3. A, -iB F ~i(A 3 B) — “i-introd., 1, 2. 

(4) 1. ~iA, ~iB, A F B — weak — i-elim. (and Theorem 9). 

2. ~iA, — iB F A3B — 3-introd. (the deduction theorem), 1, 

Lf.mma 2. Consider the truth table for any formula E containing (at 
most) the atoms P lf . . . , P„. To each of the 2" entries (or lines) in this 
table . a corresponding deducibility relationship holds. For example, let E be 
P 3 (Q V R 3 (R 3 — «P))- Corresponding to the f in Line 3 of its 
truth table in t; 2, the lemma asserts that 

P, -iQ, R F -i{P 3 (Q V R 3 (R 3 — iP))}. 

Proof. Wc explain the method using the illustration. Corresponding 
to the computation step from f, t for Q, R to t for Q V R, Lemma 1 gives 
- iQ. R F Q V R, whence obviously (or by Theorem 9): 

1. P. ~iQ, R F Q V R. 

Corresponding to the computation step from t for P to f for ~iP, (5) in 
Lemma 1 gives P I — i — iP, whence: 

2. P, — iQ, R I — i — iP. 

Corresponding to the computation step from t, f for R, — iP to f for 
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R D — iP, (2) in Lemma 1 (with R, nP as the A, B) gives R, mP P 
-i(R D — tP); combining this with 2 by Theorem 9: 

3. P, -iQ, R f- -i(R 3 -iP). 

Continuing in this manner, we obtain successively P, ~iQ, R P D or 
P, — iQ, R P “)D for each formula part D of E, according as that part 
receives the value t or f in the computation for the assignment of t, f, t to 
P, Q, R. At the end, since the whole E receives the value f, we thus have: 
5. P, -iQ, R P -i{P D(QVR3(R3 -iP))}. 

This completes our illustration of the proof of Lemma 2. 36 

It may be instructive to view in a diagram with two “trees” how each 
computation step (horizontal line in the left tree) corresponds to a 
deducibility relationship of Lemma 1 (horizontal line in the right tree). 
The left tree is the computation given in § 2, reproduced omitting repeti- 
tions in the writing of values. By Theorem 9, it follows that in the right 
tree each formula is deducible from the distinct formulas occurring at the 
tops of branches over it (or any larger set of formulas). 
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-|Q R R — i-iP 

Q V R — i(R D -iP) 

P -i( Q V R D (R D -iP)) 
— i{P D ( QVRD (RD -iP))} 


Lemma 3. If the formula E in Lemma 2 is valid (i.e. N E), then 
Pi V ~ iP,, . . . , P„ V ~ iP„ PE. 

Proof. E.g. taken = 2. Then by Lemma 2 with the present hypothesis: 

Pi, P 2 V E. 

Pi, IPs 1- E. 
iP., Pa I" E. 

-iP„ iP 2 P E. 

By two applications of V-elimination: 

P„ P 2 V~iP 2 P E. 

— iPi, P 2 V-1P 2 P E. 

3 * of course, the verification of one case of a general theorem (or lemma) does not 
prove the theorem (or lemma). The proof consists in the fact that the method used in 
the illustration is general, i.e. applies to all cases. The illustration illustrates a pattern of 
treatment, applicable to ail cases. When this fact should be obvious, we may omit 
stating the proof in general terms, as we do now. 
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By a third application of V-elimination: 

PjV-iP,, P,V -iP., h E. 

Lemma 4°. For each formula A : h A V -»A. (The law of the excluded 
middle; cf. *51 in Theorem 2.) 

We leave the proof as an exercise (Exercise 12. 2). 37 

Theorem 14°. Each valid formula E is provable; using our symbols: 
if f E, then h E. 

Prooi'. By Lemma 3 with Lemma 4 and Theorem 9 (ii) for m = 0. 

This comp letes the proof of the equivalence of proof the ory and m odel 
th eory ^ for the propositiona l calculus. Some of the results obtained in 
model theory we had to develop independently in proof theory before 
establishing the equivalence. However, we can now take over with “ h” 
replacing “k” any result we established in our model-theoretic treatment 
of the propositional calculus (§§ 2-8). 38 For example, all the results in 
Theorem 2 hold with “h” in place of “k”; before we had Theorem 14, we 
had this explicitly only for la-lOb (because we took them as axiom 
schemata), *1 (by Example 4 in § 9), *2, *3, *4a (end § 10), *19 (Exercise 
9.1) and *51 (Lemma 4). 

Natural as the development of the propositional calculus by truth 
tables (model theory) scents now, it was actually the more recent approach 
to be fully exploited, by Post, who first proved Theorems 12 and 14 in 
1921. and by Lukasiewicz in 1921, although some of the development goes 
back to Frege 1879 and Peirce 1885. Although an algebra of logic was 
initiated by Boole 1847 and De Morgan 1847, t he proof theory of th e 
propositio nal calculus properly appeared with Frege’s “BegriffsscHnft*’ in 
1879, and_i n Russ ell’s work, especially in the “Principia Mathematica” of 
Whitehead and Russell I910-13. 39 

Other propositional calculi. As illustration of our remark in 
§ 2 that there are different systems of logic, we mention that Lukasiewicz 

37 Although a proof can be given at this stage in eight lines (cf. IM p. 120 *51), it is a 
bit tricky to find. Later (end § 13) it will be easy. We refrain from giving it here in order 
not to spoil the fun the student may have in trying it himself. (A proof is also implicit 
in our demonstration of *51 0= A V — jA) by duality, following Theorem 7 in § 6; the 
ingredients of that proof are easily developed in proof theory.) 

38 These replacements in simple contexts constitute applications in the observer's 
language of the replacement rule which as applied to the object language we established 
in Corollary Theorem 5. For now we have established (with m > 0) “Aj, . . . , A,„ k B if 
and only if A,. ... , A,„ f B”, and we can use this in the role of the A ~ B of Corollary 
Theorem 5. 

39 See Church 1956 pp. 155 ff. 
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in 1920 introduced a 3-valued propositional calculus, in which the model 
theory is given using three truth values instead of just the two t and f. 
Post in 1921 (independently of Lukasiewicz) generalized from the classical 
(= 2-valued) propositional calculus to n-valued propositional calculus for 
each positive integer n > 2. To what extent n-valued logics for n > 2 are 
a tour de force is moot. 40 

Modal propositional calculi de al with such notions as “A is necessary” 
(in symbols, DA) and “A is possible” (in symbols, QA or equivalently 
— iQ— iA). These notions enter in domains of thinking where there are 
understood to be two different kinds of “truth”, one more universal 
or compelling than the other. For example, it is impossible that 2+2 = 5 
(it is contrary to mathematical laws); but it is possible that there is a large 
continent in the middle of the Pacific ocean (it is contrary only to the 
geographical facts). A zoologist might declare that it is impossible that 
salamanders or any other living creatures can survive lire; but possible 
(though untrue) that unicorns exist, and possible (though improbable) 
that abominable snowmen exist. Modern treatments of modal logic begin 
with Lewis 1912, 1917 and Lewis and Langford 1932, with some anticipa- 
tion by MacColl 1896-7. 41 

Another example of a nonclassical propositional calculus is the 
intuitijpiislicjngpositwnal calculus , in which the law of the excluded middle 
AV~ iA and the law _of double negati on ~nA 3 A are not affi rmed. 
The standpoint from which the intuitionistic system of logic arose and is 
of interest will be considered later (in § 36). We do not attempt here a 
model-theoretic description of the intuitionistic propositional calculus. 
A proof-theoretic formulation is obtained by replacing our Axiom 
Schema 8 (— i— iA D A) by : 8‘. iAD(AD B). Since any axiom by this 
schema is provable in the classical propositional calculus (cf. *10a), the 
intu itionistic propositional calculus is a subsystem of the classical, i.e. all 
formula s provable in the intuitionistic system are pravablejn the classical. 
Thos'e" oTouT" officially stated results involving “ F” (including ones we 
first stated with “t=”) which are not readily established for the intuition- 
istic system also are identified by “°”. 42 

Exercises. 12.1. Establish the first two deducibility relationships of 
Lemma 1 for & and the last two for V. 

12.2*. Prove Lemma 4. 37 


40 Sec Rosser and Turquette 1952. 

11 See von Wright 1951, Feys 1965. 

12 In fact, the results so marked in this book (and in 1M) do not hold for Ihc intuition- 
istic system; but this we are in no position to prove now. A method is suggested in 
§ 54 below which suffices for all such results in this book except Theorem 27, and the 
actual proofs are in 1M § 80. 
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12.3. True or false (and why)? 

(a) “For each formula A: if F — iA, then not F A.” 

(b) “For each formula A: if not F A, then F -iA.” 

(c) “For each formulas A and B: if F A V B, then F A or F B.” 

(d) “For each formulas A and B: if F A or F B, then F A V B.” 

12.4' *. Consider any improvable formula, *e.g. P 3 Q. Adjoin a 
corresponding axiom schema ADBto our present list la-lOb of axiom 
schemata. Show that in the resulting system, every formula is provable. 
(Post completeness, 1921; IM p. 134 Corollary 2.) 

12.5°*. Show that exactly the same formulas are provable in the 
system described in Footnote 29 § 9 with four axiom schemata (after 
allowing for &, V, ~ being symbols of abbreviation rather than “prim- 
itive” symbols) as in our system. 

12.6. Establish weak - i-elimination (in Theorem 13) for the intuition- 
istic propositional calculus. (The other twelve rules of Theorem 13 
without (double) — i-elimination hold for the intuitionistic propositional 
calculus by the same proofs as we gave above.) 

12. 7. 23 Show that: {F “t(A l & ... & AJ} =*{A! A m are not 

simultaneously t for any assignment} = {for some formula B, both 

A|, . . . , A,„ F B and A, A,„ F — iB} = {Ai A m F B, for every 

formula Bj. Hence, if S is the axiomatic-deductive system obtained from 
the propositional calculus by adding A„ . . . , A,„ as axioms: 

(Aj & ... & A,,, is consistent, § 8} = {“A 1; . . . , A,„ are consistent”} = 
{S is simply consistent} = {not every formula is provable in S}. 

§ 13. Proof theory: use of derived rules. In model theory we gave 
one answer to the question “What formulas E shall hold in (classical) 
propositional logic?”: those which have only t’s in their truth tables, or 
in symbols for which F E. In proof theory we gave another answer: those 
for which there are formal proofs using just axioms by Axiom Schemata 
la-lOb and modus ponens, or in symbols for which F E. By §§ 1 1, 12 we 
know the two answers are equivalent. 

Similarly we gave two equivalent answers to the question “What 
formulas B shall follow by (classical) propositional logic from a given list 
of formulas A 1( ...,A m ?”: those for which A t , . . . , A m F B (model 
theory), those for which A,, . . . , A,„ F B (proof theory). 

Neither in model theory nor in proof theory did we stop with these 
answers. Instead we derived various properties of “F” or “F”, which 
often are easier to use than direct applications of their definitions. In this 
section, we shall elaborate on the use of such results, especially the 
“injroducti.oiLt ind elimination r ules” of Theorem 13. We shall give the 
discussion in proof theory (with “F”), though yve could do so in model 
theory (ef. Exercises 7.4 (a), 7.6, 7.7). 
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Wc call the rules of Theorem 13 and similar results derived rules. For 
they arc results which we have “derived” about the axiomatic-deductive 
system, after establishing it by choosing (or “postulating”) the primitive 
or “postulated” rule of inference (modus ponens) and the axiom schemata 
(Axiom Schemata la-lOb). 

In using such rules, and generally in demonstrating the existence of 
deductions, we often find it convenient to write down lists of formulas 
that we can successively recognize as deducible from given assumption 
formulas Aj, . . . , A m . Apart from deductions themselves, one such list of 
formulas is the four formulas ADC, B D C, A V B, C written after 
“F, A V B b” in the explanation of 6 in the proof of V-elimination for 
Theorem 13. Another is in: 

Example 8. With A & B as the assumption formula, we can use 
&-elimination and &-introduction to construct the following list of 
formulas 1-4 deducible from A & B (left column). 

1. A & B — assumption formula. 1'. A & B — assumption formula. 

2. A - &-elim., 1. 2'. A & B D A - Ax. Sch. 4a. 

3. B — &-e!ini., 1. 3'. A — modus ponens, 1', 2'. 

4. B & A - &-introd„ 3, 2. 4'. A & B D B - Ax. Sch. 4b. 

5'. B — modus ponens, 1', 4'. 

6'. B D (A 3 B & A) - Ax. Sch. 3. 

T. A D B & A - m. p„ 5', 6'. 

8'. B & A — modus ponens, 3', 7'. 

Thence we can conclude that A & B I- B & A, i.e. that there exists a 
deduction of B & A from A & B. This list of formulas 1-4 is not itself a 
deduction of B & A from A & B, since it does not exactly fit the definition 
of deduction in §9. The list l'-8' in the right column is a deduction of 
B & A from A & B. 

In Example 8 the use of &-elim. (two rules) and &-introd. provides a 
slight abbreviation compared to constructing the deduction itself; in 
effect these rules provide certain prefabricated units. These three rules and 
eight others in Theorem 13 have the form “B b C” or “B 1? B 2 b C”. They 
assert that we can construct a deduction leading “directly” from B (or 
Bj, B a ) to C; so we call them direct rules (and likewise rules 
“Bj, . . . , B„ b C” for any p > 0). 

The other three rules in Theorem 13 (D-introd., V-elim., -n-introd.) 
enable us, from the existence of one or two “given deductions” or “sub- 
sidiary deductions”, to infer the existence of another deduction (the 
“resulting deduction”); so we call them subsidiary deduction rules (and 
likewise with any number s > 1 of subsidiary deductions). Their use 
is illustrated for D-introd. by Examples 6 and 7 in § 10. Here the 
saving compared to actual construction of deductions is more impres- 
sive. 
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Theorem 9 gives two general principles concerning the construction of 
lists of formulas recognized successively as deducible from given assump- 
tion formulas A,, . . . , A,„. By (i), each of A„ . . . , A,„ itself can be put 
into the list. By (ii), if C is deducible from any formulas B x , . . . . B„ 
already in the list, then C can be put into the list. 

In Example 8 (the left column), we used (i) at Step 1, and (ii) with 
p = 2 at Step 4. 

As a special case of (ii) (with (i)): If B x , . . . , B„ V C, and each of 
Bj, . . . , B„ is one of A x , . . . , A m , then A t , . . . , A,„ I- C. (For then by (i), 
Aj, .... A,„ h B, for i — 1, . . . , p.) Thus any formula C deducible from 
a given list of assumption formulas . . . , B„ is deducible from any list 
A,, . . . , A m which includes all of B,, . . . , B„ and maybe additional 
formulas. Using this for p = 0, any valid formula C is deducible from any 
list of assumption formulas A„ . . . , A,„. Using the same principle in both 
directions, whether A x , . . . , A m I- C holds depends only on what formulas 
occur in the list A x , . . . , A,„ with or without repetitions. 

These consequences of Theorem 9 are also obvious enough directly from 
the definition of “h”. We call Theorem 9 and its consequences "general 
properties of K\ because these properties are independent of the particular 
list of “postulates” we chose (Axiom Schemata I a— I Ob, the D-rule). The 
student should become facile in using them, either directly from the 
meaning of “ K\ or by using Theorem 9. 

In the subsidiary deduction rules of Theorem 13, the list of assumption 
formulas for the (or each) subsidiary deduction differs from that for the 
resulting deduction. So the use of these rules cannot simply consist in 
listing formulas successively recognized as deducible from one set of 
assumption formulas A 1( . . . , A,„. We could construct several such lists, 
for different sets of assumption formulas. An alternative, which we shall 
illustrate presently, is to construct one list of deducible formulas, but 
alter from place to place in the list the set of the assumption formulas 
which are “in force”, i.c. from which the formulas are being claimed to be 
deducible. 

Example 9. 'flic following list (A) of 19 statements is a reproof of 
*55a of Theorem 2 with “(•” in place of “f”. (Of course we already had 
the result by Theorems 2 + 14; cf. end § 12.) The student should have no 
trouble checking (A) step by step. (Here we are no longer mentioning 
Theorem 9 for each use of general properties of K as we did in §§ 1 1, 12.) 

1. — i(A V B), A b A V B — V-introd. 

2. — i(A V B), A I- — i(A V B). 

3. i(AV B) I — lA — — i-introd., 1, 2. 

4. -i(A V B), B b A V B - V-introd. 

5. — i(A V B), B b m(A V B). 
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6. — i(A V B) b -iB — -i-introd., 4, 5. 

7. ~i(A V B) h -|A & ~iB — &-introd., 3, 6. 

8. b -i(A V B) 3 -iA & ~iB - 3-introd., 7. W-*'-*** 4 

9. “iA & “iB, A b A. 

(A) 10. -iA & “iB, A b — iA — &-elim. 

1 1. -iA & "iB, A b — i(A V B) — weak -i-elim., 9, 10. 

12. iA &-iB, B b B. 

13. -)A &~iB, B I — iB — &-elim. 

14. -iA & “iB, B b — i(A V B) — weak -i-e!im., 12, 13. 

15. -iA &-iB, A V B I — i(A V B) - V-elim., 11, 14. 

16. - iA & — iB, A V B b A V B. 

17. -iA & -iB b — 1 ( A V B) — -i-introd., 16, 15. ^ 

18. b -iA &“iB 3 n(A V B) - 3-introd., 17. 

19. b -i(A V B)~ “iA & “iB — <~-introd., 8, 18. M 

However our purpose is not just that the student should be able to follow 
such (informal) proofs, but that he should be able to discover them for 
himself. So we now review how we are led to this proof. We are to 
establish an equivalence ~ i(A V B)~ ~iA & ~ )B (Line 19). Looking at 
the rules provided in Theorem 13, the obvious choice for our purpose is 
~-introd. This requires us first to establish two implications (Lines 8 and 
18). To establish the first of these -i(A V B) 3 -iA & iB (Line 8), the 
obvious method is 3-introd. (the deduction theorem), which requires us 
to get Line 7, i.e. to show the deducibility of — iA & ~ iB from —^A V B). 
By"&-introd., it will suffice to deduce ~iA and -iB separately. So this 
part of our problem becomes to get Lines 3 and 6. For Line 3, to deduce 
-iA the obvious method is -i-introd. (reductio ad absurdum), for which 
we add A to the assumption ~ i(A V B) we already have, and attempt to 
deduce a contradiction, i.e. to obtain — i(A V B), A bC and ~ >(A V B), 
A I — iC for some formula C. (These were written ‘T, A b B” and 
“l\ A b -jB" in the statement of ->introd. in Theorem 13; but the B of the 
rule need not be the present B, so we write it “C”.) it isn’t hard to see that 
we can do this, if we pick for C the formula A V B (Lines 1 and 2). Simi- 
larly, Lines 4 and 5 suffice for Line 6. Now we must pick up the loose end 
at Line 18. The student should try to retrace how we discovered Lines 
9-17 supporting Line 18. This isn’t quite as straightforward. To get 
Line 17, obviously we aim to get “iA & ~iB, A V B b C and “iA & "iB, 
A V B I — iC for some C. It may not be immediately evident what to pick 
for C, but it should be evident that we must utilize AVB; and V-elim. 
(proof by cases) is the rule by which we can hope to do so. So we make 
two “cases” (corresponding to the ‘T, A b C” and ‘T, B b C” of V-elim.), 
i.e. we first replace A V B as assumption formula by A (and later by B), 
and investigate what we can then deduce. Once we have a contradiction 
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within either case, by weak — i-elim. we can deduce anything else we 
please within that case. 

We numbered (A) in the logical order 1-19, in which the statements are 
inferred (or checked); but as we have just seen, we discover them by 
working upward from the bottom more or less (say in the order 19, 8, 18, 
7. 3, 6, 1, 2, 4, 5, 17, . . .). 

The student should practice finding such informal proofs of results 
about formal provability and deducibility, working up from the bottom. 
After some practice, he should become fairly adept at looking ahead. 

Now we give essentially the same proof in a more condensed format 
(Bi). Here we omit writing the symbol “f”. When we wish to employ a 
formula A as assumption formula, we say simply “Assume A”. This means 
that A is being added to the list of assumption formulas for the construc- 
tion of deductions. In this example, all the assumption formulas are 
introduced preparatory to an application of a subsidiary deduction rule 
(D-introd., V-elim. or — i-introd.). When that application occurs, the 
assumption is “discharged”, i.c. no longer remains “in force”. The 
student is expected to be sufficiently familiar with these rules to know 
when an assumption is thus discharged. To help in the discussion, we put 
small numbers before the formulas as they are successively introduced, 
i.c. either assumed or inferred. We number separately reintroductions of a 
formula under new sets of assumptions. 

I. Assume for D-introd., j“i(A V B). Assume for — i-introd., 2 A. 
Thence by V-introd., ;i A V B, contradicting — i(A V B). By the planned 
“i-introd., pA. Assume for another ~ i-introd., 5 B. By V-introd., 
6 A V B, contradicting ~i( A V B). By the “i-introd., 7 “iB. By &-introd., 

lA & ~iB. By the 73-introd., 0 — )(A VB)D ~iA & “iB. 

II. Assume for U-introd., l() ~ iA & ~iB. Assume for — i-introd., 
(B x ) n A V B. Case 1 : 12 A. From “ iA & — iB by &-elim., 13“ iA, contradict- 
ing the case hypothesis. By weak “ i-elim., 14 “i(A V B). Case 2: „B. 
From —iA & ~iB by &-elim., 16 “iB. Again by weak “i-elim., 
17 n(A V B). — By the cases (V-elim.), 18 “i(A V B), contradicting 
A V B. By the “i-introd., 19 — )(A V B). By the I3-introd., 

20 “iA & “iB 13 “i(A V B). 

From I and II by ~-introd., 21 ~i(A VB)~ — iA & “iB. 

We discover (B x ) in essentially the same way as (A), but we have “written 
it up” differently. Thus, we know we have to prove two implications to 
get the desired equivalence (by ~-introd.); and we number as “I” and “II” 
the work we do for these respective implications; For the first, we shall 
use 3-introd., so we assume its antecedent ~i(A V B). We look ahead to 
deducing “iA & “iB by &-introd. from — >A and — jB ; and toward proving 
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the first of these by “i-introd. we assume A; etc. Naturally, one does not 
always succeed in writing out such an informal proof consecutively 
without a little exploratory scratch work. 

We intend (B^ as an abbreviated, and very convenient, way of 
presenting a series of applications of the rules of Theorem 13 (with Theo- 
rem 9). 

To be sure there is no error in this, we should be able to translate (BJ 
into explicit applications of the rules of Theorem 13. We now do this, 
using the idea of a list of formulas successively recognized as dcducible, 
where from place to place in the list the set of assumption formulas from 
which the formulas are being claimed to be deducible may be changed. 

First, we write the 21 numbered formulas in (B|) in a list (B 2 ), as follows. 


(B,) 


v 


1. -)(A V B) - assumed. 

2. A — assumed. 

3. A V B - V-introd., 2. 

4. -iA — m-introd., 3, 1. 

5. B — assumed. 

6. A V B - V-introd., 5. 

7. ~iB — i-introd., 6, 1. 

- 8. iA & ~iB — &-introd., 4, 7. 

9. i(A,V B) D -iA & iB - D-introd., 8. 

10. - iA & iB — assumed. 

11. A V B — assumed. 

12. A -- assumed. 

13. -iA — &-elim., 10. 

14. —i (A V B) — weak — i-elim., 12, 13. 

15. B — assumed. 

16. iB — &-clim., 10. 

17. — 1 ( A V B) — weak i-elim. , 15, 16. 

18. — i(A V B) - V-elim., 14, 17. 

■r 19. — i(A V B) — — i-introd., 11, 18. 

20. - iA & -iB D i(A V B) — D-introd., 19. 

21. — 1 ( A VB)~ — iA & - iB — '-—introd., 9, 20. 


The arrows at the left show exactly how long each assumption formula 
remains in force. Thus, since t i(A V B) is introduced in (B,) in prep- 
aration for the D-introd. which takes place (discharging it) with result 
9 i(A V B) D -iA & iB, the arrow in (B 2 ) beginning at Formula 1 ends at 
Formula 8, the last one for which t i(A V B) is intended as an assumption. 

Now, we replace the arrows by writing in the respective assumption 
formulas themselves followed by the symbol " L”. The numbers are moved 
out of the way to the left (and assumed” is omitted). Thus (B 2 ) becomes 
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(B ;i ), and vve arc back to essentially the same format as (A). Each line in 
(B 3 ) reads correctly as an application of a rule of Theorem 13 (with 
Theorem 9) or simply of (i) of Theorem 9. From the way we have arrived 
at (B ;i ) from (B,) via ( B.,), slight differences have resulted in the exact list 
of statements compared to (A). We note two of these differences. 


1. -i(AVB)fiAVB). 

2. A, — 1( A V B) h A. 

3. A, — 1 ( A V B) h A V B — V-introd. ( 2. 

4. — 1( A V B) h “ iA — — l-introd., 3, 1. 

5. B, -i(A V B) h B. 

6. B, -i(AVB) h A VB - V-introd., 5. 

7. —i(A V B) f iB — “i-introd., 6, I. 

8. ~i(A V B) b “iA & — iB — &-introd., 4, 7. 

9. I — i(A V B) D “iA & “iB — D-introd., 8. 

10. “iA & “iB h -iA & “iB. 


(B 3 ) 11. A V B, “iA &-iB h A V B. 

12. A, A V B, “iA & iB h A. 

13. A, A V B, “iA & “iB h iA — &-elim., 10. 

14. A, A V B, -i A & “iB h “i(A V B) — weak “i-elim., 12, 13. 

15. B, A VB, “iA & “iB f B. 

16. B, A V B, “iA & “iB i — iB — &-elim., 10. 

17. B, A V B, “ iA & “iB I- — 1 ( A V B) — weak ~i-elim., 15, 16. 

18. A V B, -iA & iB h — i(A V B) - V-elim., 14, 17. 

19. “iA & “iB h — i(A V B) — - “i-introd., 1 1, 18. 

20. I — iA & “iB D — 1 ( A V B) — D-introd., 19. 

21. I — >(A VB)~~iA & “iB — ~-introd.,9,20. 


At each point in the argument (B[), any earlier result is available that 
was under only assumptions in force at the moment, i.e. in (B 2 ) that is 
opposite no other arrow. So n(AVB) in Line 1 is available for the 
~ i-introd. in Line 4 of ( B a ). To fit the “i-introd. rule as stated in Theorem 
13, we can understand in (B :) ) that Line I is first rewritten as 
A, — 1 ( A V B) I- — 1 ( A V B) by general properties of H before the “i-introd. 
(By general properties of f, the changed order of assumptions compared 
to the statement in Theorem 13 is immaterial.) 

Tn (Bj), A V B naturally comes to mind as an assumption at Step 11, 
preparatory to the “i-introd. at Step 19. We could suspend the assumption 
of A V B during the two case arguments (Lines 12-14, and Lines 15-17, 
in (B) 2 ). However it is simpler (and does no harm) to keep it in force until 
it is finally discharged. Thereby we can represent the duration of each 
assumption in (B 2 ) by a single arrow. Then the V-elim. for Line 18 gives 
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directly A V B, A V B, ~iA & — iB H — i(A V B); but this simplifies at once 
by general properties of H to A V B, ~iA & ~iB I i(A V B). — 

This concludes Example 9. We believe the student, after he has a 
good grasp of Theorem 13 (and the general properties of I-) will find (BJ 
used flexibly the easiest format. In using this he should have a clear mental 
picture of a (B 2 ), i.e. of just how long each of his assumptions remains in 
force (and of when formulas are reintroduced under new circumstances), 
whence automatically a (B 3 ) can be written that will consist step by step of 
correct applications of Theorem 13 (with general properties of h ). In any 
case of doubt, he should write out (B 3 ) or (A). 

In tackling a new logical problem, the student should of course use any 
previously established results that are available (except here, where to 
give examples and exercises we are asking the student to forget for the 
moment that all the results of Theorem 2 with “ I-” were established at the 
end of § 12). 

Not all of the results in Theorem 2 can be proved by straightforward 
use of the rules of Theorem 13 as illustrated in Example 9. All those not 
marked with “°” (and some which are) can be. 

Example 10. For *55b with “ K\ the implication 
-iA V -iB =) -i(A & B) is easy. But -i(A & B) D -iA V iB requires a 
trick. After assuming ~i(A & B), we undertake to deduce 
— i — 1 ( — iA V -iB), from which by (double) n-elim. we will be able to pass to 
-iA V -iB. So for — l-introd. we further assume ~i(— iA V -iB). Thence by 
the result (or the argument) in Example 9 (*55a), we can deduce 
— i — iA & — i— iB, and the rest is straightforward. 

In contrast to the “direct methods” employed in Example 9, here at the 
crucial point we use the “indirect method” which consists in obtaining a 
contradiction from the negation of what we want. 

Rather than handling *55b etc. as above (except for practice), we might 
as well now add the replacement theorem (Theorem 5) and the method 
of chains of equivalences to our tools in proof theory. Here we can do so 
on the basis of having them in model theory with ‘T”(§§4, 5), and the 
equivalence of 1= to h (§§ 1 1, 12). With these, as we have already seen in 
§ 5, all the rest of *55a-*6l can be established, once we have *49, *55a 
and *55c.- 

In particular (supposing *49 was established meanwhile): 

I — iA V — iB ~ — i — i( — iA V — iB) [*49] ~ — 1 (— i— »A & tiB) [*55a] ~ 

-|(A & B) £*49]. — 

Pragmatically, the student who is well versed in direct use of the rules 
of Theorem 13 (e.g. Example 9) and the chain of equivalences method 
(end Example 10), and is prepared if he runs into trouble to use the 
indirect method (beginning Example 10), will be well equipped to deal 
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efficiently with problems of provability and deducibility in the classical 
propositional calculus. * 

In the intuitionistic propositional calculus (end § 12), we don’t have 
double “ 1 -elimination, so the indirect method is unavailable; and for 
constructing chains of equivalences of course only equivalences holding 
in the intuitionistic system can be used in the links. 

Exercises. 13.1. Use Theorem 9 (i) and (ii) to justify the general 
properties of I- used in Example 9 (A) Steps 1, 2, 7. 

13.2. Show that, if f A m+1 , then A„ . . . , A m b B if and only if 
Aj, • • • , A,„, A w4 _i b B. 

13.3. Show that B I- C if and only if: for every list A 1( . . . , A m of 
formulas, if A lt . . . , A,„ b B then A,, .... A,,, b C. 

13.4. The following proof of *59 uses results coming earlier in Theorem 
2 (supposed already proved with “b”). Some obvious steps are tacit. 
Supply the tacit steps, convert the result successively to the formats (B 2 ) 
and (B :t ) (as in Example 9), and check (B 3 ): 

I. Assume ADB. By *51, A V -iA. Case 1: A. Thence (by 3-elim. 
from A 3 B) 13, whence by V-introd. ~iA V B. Case 2: ~iA. By V-introd., 
“iA V B. 

II. Assume “iA V B. Case 1 : ~iA. By *10a, A 3 B. Case 2: B. 
By Axiom Schema la, A 3 B. 

13.5. Use the rules of Theorem 13 to establish with “ b” in place of “b”: 
* 1 2a, *35, *40. *49, *55c. 

13.6. Taking *55a with “b” as already established (by Example 9), use 
an indirect proof to establish *51 with “b”. 

13.7. True or false (and why)? 

(a) "Q V F, Q 3 -iR b Q V (~iR ~ P).” 

(b) “S-T,TD(S3Q), -iR3-iP b P V S 3 R V Q.” 

13.8. Which of the four statements 

“ b A O B 3 C ~ (A 3 C) A (B 3 C)”, where O is & or V, and A is 
& or V, hold for all formulas A, B, C? Similarly, which of 
“ b C 3 A O B ~ (C 3 A) A (C 3 B)”? 

13.9°. Establish: 

(a) T b Aj 3 (. . . (A„, 3 B) . . .) if and only if, for some formula C, 

both T, A, ,. . . , A m , -iB b C and 1\ Aj A m , iB b ~iC. 

(b) A & B 3 C, ~iD 3 — i(E 3 F), C 3 (E 3 F) b A 3 (B 3 D). 

(c) A 3 B, C 3 -iB, (D 3 — iA) 3 C b -lE V A 3 (E 3 D). 

* § 14. Applications to ordinary language: analysis of argu- 
ments. In this section we shall treat some points concerning the 
application of the classical propositional calculus to reasoning in ordinary 
language (English). 
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A full procedure for solving logical problems arising in verbal form 
would be first to translate the sentences concerned into the symbolism 
of the propositional calculus, 43 and second to apply the theory and 
techniques of the calculus (as developed above) to the resulting formulas. 

In simple reasoning, we can apply the calculus without first explicitly 
translating. We have been reading & as “and”, D as “implies” or “if . . . 
then . . .” or “only if”, and “i as “not”. So we can, almost without 
conscious effect, apply simple properties of D, &, V, “> directly in 
verbal form. Many of those properties we must already have been applying 
in verbal form, since all of us have been using propositional calculus from 
when we first learned to talk. However, our seeing logical principles stated 
succinctly with the aid of symbols may help to fix them as part of our 
mental apparatus. Thus the formal study of logic may reinforce and 
extend o ur native facilit y. ~~ ‘ " 

Example 1 1 . Consider the following argument in words. Letters are 
suggested in brackets to symbolize prime components of the composite 
sentences. “1 will pay them for fixing our T.V. [P] only if it works [W]. 
But our T.V. still doesn’t work. Therefore I won’t pay them.” This 
argument can be symbolized thus: 

(1) P D W, “iW “>P. 

To say that this is correct reasoning (disregarding whether or not the 
premises P D W and ~iW are both true in fact) should mean that whenever 
PDW and -iW are both true, then ~iP is also true. This is what we 
expressed exactly in § 7 by saying that ~ iP is a valid consequence of 
P 3 W, — iW, or in symbols by 

(2) PDW, — iW 1= -iP. 

Accordingly, just in this case we will say the argument (1) is valid. By 
§§ 1 1 and 12, (2) is equivalent to 

(3) PDW, — iW b -iP. 

In establishing validity, we shall generally use this latter form, tacitly 
employing the consistency theorem (Theorem 12). (In establishing 
invalidity, we usually deal directly with (2).) We now establish (3) by 
writing down in succession formulas deducible from PDW, ~iW, 
beginning with those formulas themselves, until we reach “iP (as in 
Example 8, etc.): 1. PDW. 2. -iW. 3. mW D -tP [from 1 by contra- 
position *I2a (with Corollary Theorem 5, or and D-elim.)]. 4. ~iP 
[from 2 and 3 by modus ponens (D-elim.)]. 

43 Or to construe them as formulas in our symbolism, if the object language is verbal 
(cf. § 1 top p. 6). 
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Example 12. “If he doesn’t tell her [ — iT], she’ll never find out [ iF]. 
If she doesn’t ask [ — lA], he won’t tell. She did find out. So she must have 
asked.” We put this in symbols as follows. 

( 1 ) -iT 3 ~iF, — iA 3 — iT, F A. 

Now we establish that the argument is valid, or equivalently that 

(3) — iT 3 — iF, ~nA 3 — iT, F f A. 

1. -]T 3 -iF. 2. -iA 3 -iT. 3. F. 4. F3T [contraposition *12a, 1], 

5. T 3 A [*12a, 2]. 6. T [modus ponens, 3, 4], 7. A [modus ponens, 

6, 5], — Alternatively: 4. “lA 3 -iF [chain inference *2, 2, I (and modus 
ponens twice)]. 44 5. F 3 A [*12a, 4]. 6. A [modus ponens, 3, 5]. 

Example 13. “The Governor will retain the support of labor [L] only 
if he signs the bill [S], He will keep the farm vote [F] only if he vetoes it 
[V], Obviously, he must either not sign the bill or not veto it. Therefore 
the Governor will lose either the labor vote or the farm vote.” 

(1) L3S, F3 V, -iSV-iV ;. - 1 LV- 1 F. 

To establish 

(3) L3S, F3V, -lSV-lV bnLV-iF, 

by cases (V-elim. in Theorem 13), it will suffice to establish (3a) and (3b) 
below, each of which we do directly in parentheses (by listing successive 
consequences of the underlined assumptions). 

(3a) L3S, F3 V, ~iS I — iL V — iF 

(L 3 S, ~iS, — iS 3 — iL, -iL, “iL V -iF). 

(3b) L3S, F3 V, -iV f-iLV-iF 

(F 3 V, — iV, -iV 3 -iF, — i F, — iL V ”iF). 

The demonstrations of (3) in Examples II and 12 may appear long 
because we wrote out the reasons; we omitted the reasons in demonstrating 
(3a) and (3b) of Example 13. We hope the reader is gaining enough 
facility to write down without hesitation lists of formulas demonstrating 
deducibility relationships directly (when simple direct demonstrations 
exist). 

We hesitate to say whether a person’s having seen contraposition 
(*12a) stated as a law of logic promotes his using it with more fluency and 
sureness than before, in examples like the three foregoing. That will 
depend on how proficient he already was. But it seems likely that a 

11 Chain inference (A3B, 13 3 C t A 3 C) is sometimes culled “hypothetical 
syllogism” in traditional logic. 
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person who hadn’t previously studied logic will find in the collection 
of results in Theorems 2 and 13 (with all the supporting material) some 
useful logical principles which he wasn’t already using effectively. 

Of course our lists of principles could be extended. The result of 
Example II constitutes a principle of inference (A D B, "iB I — lA) 
called “modus (tollcndo) tollens” in traditional logic. The inference 
A V B, - lA h B (which we get by first using *61 to change A VB to 
-lA 3 B and then modus (ponendo) ponens) is called “disjunctive 
syllogism”. The whole result of Example 13 

(ADC, B D D, “tC V — iD h “iA V — iB) is called “destructive dilem- 
ma”, as the Governor can appreciate. We omitted these and others from 
our compilations above only because they are such immediate consequences 
of principles we did include. The student may add them and any other 
correct principles he wishes to his working lists of principles and rules. The 
foregoing theory (especially § 13) should make it easy to prove such addi- 
tional results as he desires. Different persons will have different habits and 
preferences as to which principles they use directly and which they 
assemble out of others when needed, and these habits may change with 
time and with the problems at hand. 

„ Now, whether or not logic studied formally actually augments our 
,7 native capacity to di scover correct arguments , it certainly is of value in 
j checking the correctness ol' proposecfarguments. For, i t provides an anal y- 
sis of the bas is of reasoning with both models f model theory) and norm s 
for correct- reasoning (proof th eory). Thus we can appeal to formal logic 
to confirm the correctness of our reasoning or to detect errors in it, when- 
ever we find ourselves in risk of becoming confused. And if we have not 
found ourselves prone to confusions in reasoning, we have no doubt 
noticed that others sometimes arc, especially our adversaries in arguments. 

ExAMi’Lii 14. “He said he would come [C] if it doesn’t rain [ iR] 
(and we can depend on what he says). It's raining. Therefore he won’t 
come”. In symbols, 

(1) -iR D C, R -iC. 

To try to confirm this by establishing ~)R D C, R I — iC, we naturally 
try modus ponens. But we would need — iR to use that directly. If we first 
contrapose ~iR D C (by *12a, simplifying “V iR to R by *49), we obtain 
-iC D R, and we’re no better off. Thus what was spoken as though it 
were an obvious one-step inference (“Therefore”) doesn’t follow by any 
obvious principle. If we hadn’t studied truth tables, we would already be 
able to say to the speaker that we don’t see how her conclusion follows, 
and strongly suspect that it doesn’t follow. But we have studied truth 
tables. Giving R, C the values t, t, both the premises (assumption formulas) 
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— iR D C, R become t, while the proposed conclusion ~iC becomes f. 
Thus not -iR DC, R1 ~iC. So the conclusion really doesn't follow. 
“Maybe he’ll come anyway. Or, did he rather say, ‘I’ll come only if it 
doesn’t rain’?”. 15 

In examples like these, we admit to laboring the obvious. But it is not 
so easy in long chains of deductive reasoning to be sure one has nowhere 
gone astray. In polemical arguments, the speaker may be deliberately 
attempting to lead his audience to a conclusion which his assumptions do 
not strictly justify. 

To check any separate step in a chain of reasoning, if it is correct, should 
be quite easy, once the reasoning has been clearly formulated. Translation 
into logical symbolism (carried out partially or fully as the circumstances 
may require) forces a resolution of any ambiguities or obscurities there 
may be in the verbal presentation. It is outside the scope of this book to 
treat exhaustively the topic of verbal argumentation. 40 But we shall note 
a few aspects of it. 

The principles we are noting here will come up again in the predicate 
calculus (Chapter II), where there are more ways of going astray if one is 
not explicit and careful. 

An example of a simple argument which we cannot adequately analyze 
now, but will be able to in the predicate calculus, is the following: “All 
men are mortal. Socrates is a man. Therefore Socrates is mortal.” As we 
recall from § 1, the propositional calculus is concerned only with logical 
relationships which result from the way sentences are built from certain 
sentences (called by us prime formulas or atoms) which enter as unanalyzed 
wholes. In this example, all we can say now is that the foregoing argument 
has the form P, Q R; and P, Q 1= R does not hold. We give this example 
as a warning of what not to attempt in the way of logical analyses at this 
stage. 

We recall that, for the classical propositional calculus, it is assumed that 
each unanalyzed sentence (or atom) is either true or false but not both ; 
and we are to use no more than this (§ 2). In classical mathematics, this 
assumption is considered as holdi ng strictly (§ 36). In dai hUilg^ it is 
notoriously the case th c>t proposit ion s do noJ LillataysJ 'all neat b^into two 
classes, the true and the false. We may tell the three dozen people w'e are 
inviting to a picnic, “If the weather is good [G], the picnic will be this 

In verbal examples, we suspend our convention of using only P, Q, R, . . . , P,, 
P 2 , P 3 , ... for prime formulas (atoms), to allow the letters chosen for the prime formulas 
to match the words (as here, C for “he will come”), We already did this in Examples 
11-13; but those three applications of propositional calculus did not depend on the 
formulas symbolized being prime. 

4,1 Cf. Clark and Welsh 1962. 
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Sunday [S]; otherwise, the next good Sunday [N]”. Then when Sunday 
comes, the weather is not good, and everybody knows not to show up. 
(G3S)&("iG3N), iG b N. Only, often it isn’t this simple. Sunday 
morning the weather outlook is very ambiguous. We have to decide 
whether to go through with the picnic (risking that everyone will be 
miserable); or to postpone it (and have the family spend half the week 
eating up the picnic hamburger and rolls). 

Nevertheless, the classical propositional calculus is very useful in daily 
life. It helps us to be precise in our logic, even when we must recognize 
there is an element of guesswork in the assumptions. Thus we may phrase 
a number of statements, none of which we are absolutely sure is true, but 
which represent our best appraisal of the facts. Then we would like to 
know exactly what will follow. Or we may wish to consider several alterna- 
tive sets of assumptions, perhaps attaching probabilities to them; then, 
before deciding upon some course of action, we deduce by two-valued 
logic what consequences follow exactly from each set of assumptions. 

These rather obvious remarks are to emphasize that one should not lose 
sight of the role of his logic in the total intellectual situation. Our allusions 
to modal and intuitionistic propositional calculi (end § 12) illustrate that 
there are situations which call for other kinds of logic than the classical. — 
In case of doubt about a piece of verbal propositional reasoning, trans- 
late it into the symbolism of the propositional calculus. 

Here is a list (not exhaustive) of expressions on the right which can 
(or often can) be translated by the symbols on the left. 

A ~ B. A if and only if B. A iff B [an abbreviation]. 

A if B, and B if A. If A then B, and conversely. 

A exactly if B. A exactly when B. A just in case B. 
A is (a) necessary and sufficient (condition) for B. 

A is materially equivalent to B. 

A is equivalent to B [sometimes]. 

A 3 B. If A, then B. B if A. 

When A, then B. B when A. 

In case A, B. B in case A. 

B provided that A. 

A is (a) sufficient (condition) for B. 

B is (a) necessary (condition) for A. 

A materially implies B, A implies 

A & B. A and B. Both A and B. 

A but B. Not only A but B. 

A although B. A despite B. A yet B. 


A only if B. 

A only when B. 

A only in case B. 


B [sometimes]. 


A while B. 
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A V B. A or B or both. A and/or B [in legal documents], 

A or B [usually]. Either A or B [usually], 

A unless B [usually], A except when B [usually], 

(A V B) & ~i(A & B) A or B but not both. A or else B [usually], 
(equivalently by A or B [sometimes]. 

Exercise 5.3 (b): Either A or B [sometimes], 

— 1( A B), — lA ~ B, A unless B [sometimes], 

A ~ — ! B). A except when B [sometimes], 

— 1 ( A V B) (equivalently by *55a, - iA & ~ iB). Neither A nor B. 

~iA. Not A [or the result of transforming A to put “not” just after 

the verb or an auxiliary verb], 

A doesn’t hold. A isn’t so. 

It is not the case that A. 

In translating English words by our propositional connectives with two- 
valued truth tables, shades of meaning that are present in the English 
words may be lost, at the same time that precision for the purpose of logic 
is gained. 

Although in the propositional calculus, A & B is equivalent to B & A, 
the report “Jane had a baby and got married” will strike Jane’s friends 
differently from “Jane got married and had a baby”. 47 In this example, 
the order of the conjunctands suggests a temporal (or causal) succession. 
The temporal succession can be rendered in classical logic, if we wait until 
wc have the symbolism of the predicate calculus. But the translation 
A & B is simpler, and suffices for logical analyses in which the temporal 
(or causal) component doesn’t matter. 

Expressions like “A but B”, “A although B” and “A despite B” have 
nuances of meaning not possessed by “A and B”, and lost in the transla- 
tion A & B. The young man may respond differently if his girl friend tells 
him “I love you and I love your brother almost as well” than if she says 
“I love you but I love your brother almost as weII".‘ ,B 

Although we propose to translate “A unless B” ordinarily by A V B, 
which is equivalent to BVA, one would say “I won’t go unless she 
apologizes” but hardly “She apologizes unless I won’t go”. 4 ” This is 
another example in which the English suggests a temporal or causal 
succession. 

A further difficulty in translating is that there may be ambiguity as to 

17 Example from Strawson 1952 p. 80. 

” Example from Suppcs 1957 p. 4. 

19 Example from Clark and Welsh 1962 p. 45. 
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what meaning reduced to terms of two-valued truth tables is intended. 
When the dinner menu says, “Tea or coffee included”, we are not surprised 
to be charged extra if we order both. If it is announced that donations of 
books will be received at the school or at the church, we won’t expect our 
books to be refused at the church because we have already left some at the 
school. Since the inclusive “or”, symbolized by V, is the more useful one, 
we personally are accustomed to using “or” inclusively (not bothering to 
add “or both”); and always adding “but not both” or the like in any cases 
of doubt when we do want the exclusive “or”. If A and B are such that 
— i(A & B) is known or assumed anyway, then the inclusive and the exclu- 
sive “A or B” are equivalent, and one might as well use the simpler transla- 
tion A V B. (Indeed, ~i(A & B) h (A V B) & ->(A & B) ~ A V B.) Thus in 
a lecture to a mathematical audience, when 1 say “« is even [A] or n is an 
odd prime [B]”, it is immaterial whether the sentence be understood as 
A V B or as (A V B) & ~i(A & B). But if I were speaking to an audience 
of people who did not know that a number cannot be both even and an 
odd prime, it could make a difference which way my “or” was understood. 
Although it is farfetched to suppose such an audience would have invited 
me to speak about /?, a similar situation is less so. We might be analyzing 
an argument involving the sentence “« is even [A] or n is an odd prime 
[B]”. Then if the premises do not include enough facts about the number 
system so that — 1 ( A & B) is deducible from them, it could be necessary for 
the argument to be valid either to use the translation (A V B) & ~ >(A & B) 
instead of simply A V B, or to add — j(A & B) or something that entails it 
to the premises (whereupon (A V B) & — i(A & B) A V B becomes 
deducible from them). 

The meaning of “A unless B” and “A except when B” which we prefer 
and use ourselves is that B is an “escape clause” which lets us off from 
our assertion of A in the case of B; i.e. in saying “A unless B” we intend 
to claim that A when ~iB, and to make no claim when B. Then ~iB 3 A, 
or equivalently A V B, is the translation. But one must be on guard to 
recognize when a speaker intends to claim that — iA when B; then the 
translation is (~iB 3 A) & (B 3 -jA), or equivalently (by Exercise 5.3 (b)) 
(A V B) & — i(A & B), so “unless” and “except when” in this usage amount 
truth-functionally to the exclusive “or”. 

In ordinary language, we don’t use parentheses to indicate how the 
parts of a composite sentence are to be associated, and often fairly subtle 
clues serve instead. “If Jones is present [J] or Williams speaks up for our 
proposal [W] and Stark doesn’t come out against it [ — iS], it will be adopted 
[A].” Should we translate this as (a) (J V W) & iS D A or as (b) 
JV(W &~iS)o A? In written language a comma before “and” would 
resolve the ambiguity in favor of (a); in spoken language, we can indicate 
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(a) by putting “both” before “Jones” or repeating “if” before “Stark”, 
and accenting “and” (after a slight pause). 

To summarize, the process of translating from English to symbols is 
not automatic. The translator must first seek to understand clearly the 
passage to be translated. If it is his own words, he must choose the inter- 
pretation he intends. If it is the words of another, he must undertake to 
divine that person’s intentions when the words are ambiguous, using any 
clues the context may provide; and he may even need to try out different 
translations to see which makes the best sense. 

In translating from symbols to English words, if we choose from the 
tops of our lists of translations we can hardly go wrong in rendering the 
connectives, but care will still be needed to be sure that the scopes 
of the connectives are unambiguously rendered. The result should then 
be unambiguous, but may be far from being idiomatic or in good 
style. 

Exercises. 14.1. Translate each of the following arguments into 
logical symbolism, and analyze the result in the manner of Examples 11-14. 

(a) If he belongs to our group [B], he is virtuous and trustworthy [V & T]. 
He does not belong to our group. Therefore, either he is not virtuous or 
he is not trustworthy. 

(b) Unless taxes are raised [R], there will be a deficit [D]. If there is a 
deficit, state services will be curtailed [C]. Therefore, if taxes are raised, 
state services will not be curtailed. 

(c) If he is responsible for this rumor, he must be either stupid or un- 
principled. He is neither stupid nor unprincipled. Therefore he is not 
responsible for the rumor. 

(d) If the suspect committed the robbery, either it was planned very care- 
fully or he had an insider as confederate. If it was planned very carefully, 
then, if he had an insider as confederate, more loot would have been taken 
than was taken. Therefore the suspect is innocent. 

(e) If peace breaks out, then there will be a depression unless the country 
rearms with new weapons or carries out a massive domestic program in 
education, conservation, antipoverty, etc. It will not be possible to get 
agreement on what a massive domestic program should be. Therefore, if 
peace breaks out and there is not to be a depression, the country must 
rearm. 

(f) The proposed attack will succeed, only if the enemy is taken by surprise 
or the position is weakly defended. The enemy will not be taken by sur- 
prise, unless he is overconfident. He will not be overconfident, if the posi- 
tion is weakly defended. Hence the attack will not succeed. 

(g) Unless we continue price supports, we will lose the farm vote. Over- 
production will continue if we continue supports, unless we institute 
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production controls. Without the farm vote, we cannot be reelected. 
Therefore, if we are reelected and do not institute production controls, 
over-production will continue. 

(h) If (1) x + 3 = V3 — x, then x 2 + 6x + 9 = 3 — x. But 

x 1 + 6x -f 9 = 3 — x if and only if ( x + 6)(a: + 1) = 0, which is the case 
if and only ifx=— 6orx = — 1. Therefore only —6 and — 1 can be roots 
of the equation (1); i.e. x + 3 = V3 — x implies that x = —6 or x = — 1. 

(i) Like (h), except changing the conclusion to: Therefore —6 and —1 are 
roots of (1); i.e. x = —6 implies x + 3 — V3 — x, and x = — 1 implies 
x + 3 — V3 — x. 

14.2. (Keisler.) Brown, Jones and Smith are suspected of income tax 
evasion. They testify under oath as follows. 

Brown: Jones is guilty and Smith is innocent. 

Jones: If Brown is guilty, then so is Smith. 

Smith: I’m innocent, but at least one of the others is guilty. 

Let B, J, S be the statements “Brown is innocent”, “Jones is innocent”, 
“Smith is innocent”, respectively. Express the testimony of each suspect 
by a formula in our logical symbolism, and write out the truth tables for 
these three formulas (in parallel columns, like (h)-(l) § 7). Now answer 
the following questions. 

(a) Are the testimonies of the three suspects consistent? (Exercise 12.7.) 

(b) The testimony of one of the suspects follows from that of another. 
Which from which? 

(c) Assuming everybody is innocent, who committed perjury? 

(d) Assuming everyone’s testimony is true, who is innocent and who is 
guilty? 

(e) Assuming that the innocent told the truth and the guilty told lies, who 
is innocent and who is guilty? 

*§ 15. Applications to ordinary language: incompletely stated 
arguments. In daily life and public affairs, it is common for arguments 
to be given in which the intended premises (or assumption formulas 
A l , . . . , A m ) are not all of them explicitly stated. It would be beside the 
point to chide a speaker by saying that his argument is not valid because 
not Aj, . . . , A p l= B where A x , . . . , A p express the premises he stated, 
when it would be fair to assume that he intended further premises 
A fl+1 , .... A m to be understood. Arguments that are intended to have 
such tacit assumptions A p+1 , . . . , A„, may be called enthymemes. Tradi- 
tionally, the term related to the inferential patterns (or syllogisms) of 
traditional logic. As logic has now become more flexible, it is natural to 
extend the term to cover any argument in which one or more premises or 
the conclusion is tacit. 
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Enthymemes have a proper and weli-nigh indispensible role. Without 
them communication would become exceedingly slow and tedious. We can 
properly omit what should be obvious; we'll quickly lose our audience if 
we don’t. A premise may be obvious for an argument, because it’s well 
known and universally accepted, or because we have recently been talking 
about it. But also, if it’s to be omitted without obscurity, what is left of 
the argument should more or less clearly indicate that it’s called for as a 
premise. Indeed this alone can be sufficient license for leaving it tacit. 
Thus I could say to a hostess who didn’t already know that I planned to 
retire early, "Iff drink coffee [C], I can’t get to sleep early [— iS]. So please 
don’t pour me any”. The enthymematic argument (before supplying the 
missing premise) is 

C 15 -IS -iC 

It's clear enough that this is an abbreviation for 

C 3 iS, S -i C. 

In the black art of persuasion, enthymemes may be employed to detract 
attention from a premise whose truth the hearer might doubt. 

As an enthymeme with the conclusion tacit, if 1 have just been offered 
a cup of coffee, simply C 3 -iS would be clearly enough an abbreviation 
for C 3 -iS, S /. ~iC. If we communicate statements that (perhaps with 
others that are obvious) would constitute the premises for the inference of 
a conclusion that we prefer not to state baldly, we are engaging in innuendo. 

Thus logical analysis comes to include attempting to supply premises 
(or conclusion) for what is ostensibly an incompletely stated argument. 
In some cases, the form of the argument may leave no room for doubt 
about what is to be supplied. In other cases, we may have to experiment 
with different trials of unstated premises A„ tl , . . . , A,„ in the attempt to 
find a set that will make the argument valid; and we may find more than 
one such set. 

Hence it is appropriate to pause and consider more carefully what an 
argument “A!, . . . , A,„ B” is. When someone says "A 1( . . . , A,„ B”, 

he doesn’t simply mean that B is a valid consequence of A t , . . . , A,„ (in 
symbols, A,, . . . , A,„ f B). He also intends to claim that A,, . . . , A,„ are 
true (or at least available as though true). Thus the full meaning of 
“Aj, . . . , A,„ B” is “(i) A„ . . . , A m are true, and (ii) A x , . . . , A m 1= B; 
and therefore B is true”. The purpose of the argument is to persuade the 
hearer of the truth of B on the grounds (i) and (ii). When both (i) and (ii) 
hold, we call the argument not simply “valid" but sound. 

Whether A, A m are true or not may be a matter of empirical fact, 

or of belief, or may rest on earlier assumptions under which the argument 
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is being pursued and which make A lt . . . , A m available for the purpose 
of the argument. Soundness is thus relative to whatever criteria or stand- 
ards are being presupposed in the claim of the argument that A x , . . . , A m 
are available; and a full statement on the matter of soundness would 
include such reference. Also it seems convenient to recognize graduations 
by calling an argument simply plausible when it is valid but we can only 
say that A 1( . . . , A m are plausible. 

Now if there are different choices of unstated premises that will make 
an enthymematic argument valid, it will be of interest to see if a choice 
can be found for which they are all true (or at least all plausible). Whether 
or not the proposed premises A u .... A ni are all true is of course ordi- 
narily not a question of logic. 

If f -^Aj & ... & A m ) (A 1 & . . . & A m is inconsistent , beginning § 8), 

then “A x A„, .'. B” is unsound simply by logic (cf. Exercise 12.7). 

An opponent’s argument will be destroyed, if his premises are shown to 
be inconsistent, or to be “inconsistent with the facts”, i.e. to become 
inconsistent on adding to them other statements known to be true. 

Similarly, if f A x & . . . & A,„ (A x & ... & A m is valid). A,, .... A m are 
all true on logical grounds; so then, if “A 1( ...» A m B” is valid, h B. 

It is in the remaining case, that neither h “ifA! & ... & A m ) nor 
h A, & . . . & A m (A t & . . . & A m is contingent , beginning § 8), that the 
soundness of a valid argument “A x , . . . , A m .’. B” depends on considera- 
tions outside of logic. 

In discourse, it may not always be clear when an argument (perhaps 
enthymematic) is intended. Here are some phrases which when inserted for 
the blank in “A x , . . . , A„, ; — B” indicate an argument “Aj, . . . , A m B” ; 

therefore, hence, whence, thence, so, it follows that, we infer that, 
we conclude that, consequently, but then, these imply, thus. 

The following phrases have the same effect when inserted for the blank 
in “B; — A 1( . . . , A m ”: 

this follows from, in consequence of, for, this is implied by, 
because, since, in as much as. 

The word “implies” requires special comment. According to the con- 
text and the user, “A implies B” may mean (I) “if A then B” or in symbols 
A3B, or (II) “B follows by logic from A” or in symbols “A h B”, 
which is equivalent to “A I- B”, to “t= A Z3 B”, and to “1A3B”. In 
brief, “implies” may be translatable by (I) or (II) either of “N” and 
“h”. Clearly, an implication in Meaning (1) is a statement in the object 
language; in Meaning (II), a statement in the observer’s language. 11 
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Under (I), the truth of “A implies B” will depend ordinarily on cir- 
cumstances outside of logic, e.g. on matters of empirical fact; hence the 
name “material implication” for A3B, Under (II), “A implies B” is 
a “logical implication”, being true exactly when A D B is t for all assign- 
ments of t's and f’s to the prime components of A D B; in later chapters 
and elsewhere, this will generalize to “PAD B” in whatever system (in 
place of propositional calculus) is under consideration there. Some writers 
(beginning with Quine 1940) avoid (I), and call A Z> B a “conditional” 
rather than an “implication”. 

Similarly, “A is equivalent to B” may be translated by (1) A ~ B (“A is 
materially equivalent to B”) or (II) either of “P A ~ B” and “ P A ~ B” 
(“A is equivalent to B in the system to which P and P refer”). Those who 
call A 3 B a conditional call A~Ba “biconditional”. 

In this book, we generally avoid Meaning (II) of “implies” (finding “P” 
and “P” more convenient and explicit). But sometimes we use Meaning 
(II) of “equivalent”, indicating explicitly or by the context the reference 
to a system (cf. beginning § 5, end § 8). 

It is more or less customary in definitions to write simply “if” or “when” 
or “in this case” meaning “if and only if” or “equivalent”. (We did so in 
the definitions of “valid consequence” §7, of “inconsistent” and “con- 
tingent” in §8, of “provable”, “deduction” and “deducible” in §9, and 
in effect in the definition of “valid” in § 2 by saying “Such” meaning 
“Such and only such”.) — 

The verbal examples above all led to quite trivial logical problems when 
they were translated into symbols. 

It is easy enough to make up problems in which the logical part would 
be more challenging. All we need to do is to take some chain of arguments 
(in which conclusions of earlier arguments become premises of later ones), 
and give only the original premises (i.e. those not conclusions of earlier 
“links”) and the final conclusion. Such problems that are both ingenious 
and entertaining have been constructed by a number of logicians, above 
all by C. L. Dodgson (= Lewis Carroll) 1887, 1897. Whether they belong 
to the serious study of logic, or only to the department of mathematical 
puzzles and recreations, is a matter of one’s educational philosophy. 

We offer some reflections, without claiming definitely to answer the 
question. 

First, consider the role of logic in checking the validity of arguments 
already constructed. (In § 14, we took this second.) The person giving an 
argument, whether in ordinary conversation, or in a debate or a trial, or 
in mathematical exposition, is supposed when he says “therefore” to be 
making a step which his hearers or readers can reasonably be expected to 
follow. Thus it is in the nature of an individual argument that it should be 
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simple (or if it is fallacious, that it should pretend to be simple). It is the 
arguer’s job, if he is seriously claiming to demonstrate something, to break 
his demonstration into a series of chunks that can each be swallowed. If 
the individual arguments or inferences (chunks) get too complicated, we 
can accuse him of not having given us a demonstration. Of course, how 
big a step can reasonably be justified by “therefore” is a function of the 
audience’s proficiency in logic, and familiarity with the subject matter. 
Individual arguments in the chain may be expressed enthymematically; 
and in an extended piece of exposition (like a text book) successions of 
individual arguments that recur may be combined into individual argu- 
ments after the audience has had time to become familiar with them. We 
conclude, however, that “arguments” consisting of a list of premises, 
followed by a conclusion (with “therefore” prefixed) that can only be 
inferred by a long succession of steps of kinds with which the audience 
can be expected to be familiar, are rather artificial. Such “arguments” do 
not naturally occur. 

Of course, the arguer may elect not to give his demonstration in full on 
a given occasion. He may omit some steps that the audience is not supposed 
to be able without pausing to supply (which omissions we would not 
consider enthymematic), saying “It can be proved that . . (It is poor 
style to substitute the phrase “Obviously . . .”.) Then we have the question 
whether it is the reader’s (or hearer’s) job to try to fill in the missing steps 
(which could constitute an extended logical problem of the sort we are 
discussing). 

A similar situation arises if one of the allegedly simple arguments in a 
chain of arguments offered to us is fallacious. Refuting that individual 
argument by a choice of truth values doesn’t settle whether the whole 
demonstration, taken as an argument from the original premises to the 
final conclusion, is valid or not. Is it our job to make a repair, or to demon- 
strate the impossibility of repair? Of course, if a trifling change will fix 
matters, we would feel silly not to have noticed it. 

But how much work we should be expected to do in either of these 
situations would depend on our stake in the matter. Primarily, it is the 
responsibility of the person who would demonstrate something to supply a 
demonstration, broken into reasonably sized pieces, each individually valid. 

We now come to the other role of logic, which is to discover demon- 
strable results and demonstrations of them. We aren’t just looking for valid 
formulas and valid arguments. We are interested in the fact that P D P 
is valid; but much less so in the facts of the validity of each of the formulas 
of the infinite collection of which 


— I— l— I— l(P D P) & (P D P) & — I— I(P D P) & — 1— I— l— i— 1— i(P D P) 
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is another member. A catalog simply listing all the valid formulas found 
among the formulas up to some given length would not sell. Whether the 
validity of a formula or an argument interests us depends on whether it 
has some bearing on actual affairs, or may play a role in some logical or 
mathematical theory, which may in turn interest us for practical or theoret- 
ical reasons. So a question whether 1= E or whether A„ . . . , A m k B is 
only likely to interest us if we have some reason for wanting the result that 
the conjecture does (or does not) hold. This reason may be the possible 
applicability of the result to questions that previously interested us, or it 
may be a reason suggested by the form of the conjecture itself, or it may 
be that in settling the conjecture we anticipate developing interesting 
methods. Moreover such a reason often provides hunches why the con- 
jecture may hold, and thus clues as to how to go about trying to 
establish it. 

For these reasons we are rather skeptical that complicated logical prob- 
lems, given in vacuo (with motivation or clues), are very relevant to the 
use of logic, even to its use in discovering quite complicated proofs. But 
a very firm grasp on the individual rules of logic, the simple principles, 
repeated applications of which are used to construct complicated proofs, 
is essential. Practice with these principles can of course be obtained in 
using them to derive still other logical principles, which in turn may be 
useful, and in applying them in the course of deducing interesting results 
in mathematics, science and daily life, instead of in working extended 
made-up logic problems. So we regard the logical theory and applications 
in the following chapters as in part a substitute for more problem work 
here. 

We conclude this chapter with one more example, to illustrate a domain 
of logic in which complicated problems may naturally arise. This is in 
simplifying logical expressions. We refer to the theory given in § 8. 

Example 15. (Venn 1881 p. 261.) A certain club has the following 
rules: (1) The Financial Committee shall be chosen from amongst the 
General Committee. (2) No one shall be a member both of the General and 
Library Committees unless he be also on the Financial Committee. (3) 
No member of the Library Committee shall be on the Financial Com- 
mittee. Simplify these rules. Solution. Let x be any person (member of 
the club, presumably). Let P be “* is on the Financial Committee”, Q be 
“x- is on the General Committee”, R be “ x is on the Library Committee”. 
Then ( 1 )— (3) are expressed by (P Z> Q) & (~i(Q & R) V P) & i( R & P) 
(Exercise 8.2 (c)). This has the conjunctive normal form (§ 8) 

(— i iQ V — i R) & (Q V — iP), which is equivalent to (Q3"iR)&(P3Q). 
Thus a simpler set of rules is (1) with: (2') No member of the General 
Committee shall be on the Library Committee. 
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Exercise. 15.1. Premises (or conclusion) may be missing in the 
following. If so, attempt to supply them to produce a valid argument. 
Do you consider the (completed) argument sound (or at least plausible)? 

(a) The accused could be guilty of the crime [G] only if he was in New 
York at 6 p.m. on January I [N], But it has been established that he was 
in Washington at that time [W], Therefore he is not guilty. 

(b) We have no proof that he committed the crime. Therefore he must be 
acquitted. 

(c) We have no proof that he committed the crime. Therefore he is 
innocent. 

(d) If it has snowed, it will be poor driving. If it is poor driving, 1 will be 
late unless I start early. Indeed, it has snowed. Therefore 1 must start early, 

(e) If you use our “Gallant Tailor Spray”, you will not be troubled by 
insects. 
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§ 16. Linguistic considerations : formulas, free and bound occur- 
rences of variables. In the propositional calculus, we studied those 
logical relationships which depend on how some propositions are com- 
posed from other propositions by operations (expressed by the sym- 
bols D, &, V, -i) in which the latter propositions enter as unanalyzed 
wholes. In the predicate calculus , we carry the analysis a step deeper to 
take into account also what in grammar is called “subject-predicate struc- 
ture”, and we use two further operations, V (“for all”) and 3 (“for some” 
or “there exists”) which depend on that structure. (The predicate calculus 
includes the propositional calculus.) 

Consider the proposition (expressed by the sentence) “Socrates is a 
man”. The part of this proposition (expressed by) “ — is a man” or “x is 
a man” is a predicate; “Socrates” is a subject. Read “a; is a man”, using 
the mathematical notation of a variable, the predicate is seen to be a 
propositional function , i.e. for each value of the (independent) variable x , 
it becomes (or takes as value) a proposition, true for example when x is 
Socrates, false in Greek mythology when x is Chiron, and in the Kleene 
household when x is Fleck. To take another example, “John loves Jane” 
is a proposition, which can be thought of as a value of any one of three 
propositional functions, “ x loves Jane”, “John loves y" and “x loves y . 
In grammar, "x loves Jane” is a predicate, but not “John loves y ” or 
“x loves //”; and a value of “x” is a subject, of “«/” an object. Mathemati- 
cally, these distinctions are unimportant. We shall simply adopt the term 
predicate as short for the more cumbersome propositional function 
P(x l , . . . , x„), for any number n > 0 of (independent) variables; 50 and the 
term object or individual for a value of any one of the variables. For n = 0, 
we have a proposition as a special case of a predicate; for n = 1 a property; 
for 11 = 2, a (binary) relation; for n — 3, a ternary relation; etc. 

This explains the name predicate calculus for the logic of propositional 

50 The term " independent variable” arises from the mathematical practice with 
functions like x 2 + 3x + 1 or sinx of writing “;/ = x 2 + 3x + 1” or “?/ = sin x” 
where “//” is another variable (the dependent variable) assuming the values of the 
function. We shall not do thus here. 
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functions. A fully descriptive (but cumbersome) name is calculus of propo- 
sitional functions . 8I 

The notation “ x is a man” is a bit more concise than “ — is a man”. The 
advantage of variables over blanks to show the “open places” in a predicate 
is increasingly apparent in examples like “ — j loves — 2 ”, “ — t loves — ,” 
(synonymous with “ — loves himself”), and “ — j is father of — 2 , or • — 1 is 
mother of — ,” (synonymous with “ — , is a parent of — 2 ”). 

Such expressions as “ — , loves — 2 ” or “x loves y" are not used directly 
in ordinary language. 88 To relate them to ordinary language, we may begin 
by regarding the blanks or variables which we have introduced as “place 
holders” for words naming objects. Of course, the words to be supplied 
do not need to be proper nouns, like “John” and “Jane”. We can have, 
for example: (aj “Somebody loves Jane”, (a 2 ) “There is someone who 
loves Jane”, (b) “Nobody loves Jane”, (c) “Everybody loves Jane”, (d) 
“Everybody loves someone”, (e) “Someone is loved by everybody”, (f) 
“Everybody loves himself”, (g) “There is no one who does not love him- 
self”. Using “L(x, ?/)” as short for “x loves y", and supposing for the 
moment that “bodies” or persons are the range of the variables ‘V’ and 
these can be written using V and 3, thus: (a') 3xL(x, Jane), (b') 
- i3xL(x, Jane), (c') VxL(x, Jane), (d') Vx3yL(x,y), (e') 3t/VxL(x, y), 
(f) VxL(x, x), (g') ~i3x— iL(x, x). In this symbolism we have formed sen- 
tences expressing propositions, without displacing or completely displacing 
the variables “x” and “y". Note particularly that in (a') as expressing (a^ 
(which is synonymous with (aj)), “someone” and “who” are each repre- 
sented by an (occurrence of) “x”; similarly, three different words of (g) 
are each represented in (g') by an “x”. 

Here, rather than thinking of variables as simply (or always) “place 
holders”, they may be thought of as a stock of names (nouns or pronouns), 
which are available to us to name various objects. What objects are named 


51 The name functional calculus is often used. This name has seemed to us a little 
unhappy (though historical precedence can be claimed for it), as it leaves out of account 
the most descriptive part, that the functions are propositional, Thus it might suggest 
functions from numbers to numbers (tike ar 2 + 3x + 1 and sin x, where for each number 
as value of "x’\ x 2 + 3x + 1 and sin x become numbers) ; these functions are commonly 
called simply "functions” when propositional functions are being called “predicates”. 
A further possibility of confusion is with functionals, in the sense of functions from 

functions of the last sort to numbers (e.g-J” f(x)dx, which, for each suitable function 

like x 2 + 3x + 1 or sin x as value of/, becomes a number); the branch of mathematics 
called “functional analysis” deals with these. 

52 Except as the mathematical notation of variables has come to be adopted in such 
examples as “A loves B”, “If A docs this, B will do that”, with “A”, “B”, "C”, ... as 
variables for persons. 
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may depend on how they arc incorporated into sentences, or on the con- 
text in which those sentences appear. 

The use of variables is thus not basically so very different from con- 
structions that are used in ordinary language. “Somebody” and “every- 
body” serve as names for unspecified persons; and even the proper name 
“Jane” isn’t specific, unless we have explained that we mean Jane Austen, 
or Jane Grey, or Jane Addams, or Jane who lives down the street. The 
legal “John Doe” is the equivalent of a variable (ranging over men) whose 
value is being left unspecified. 

In the propositional calculus we studied logical relationships between 
propositions without taking into account what propositions are expressed 
by the prime formulas. This comes to the same thing as saying that, for 
the logical relationships we dealt with there, the propositions expressed 
by “P", “Q”, "R”, . . . could be any propositions. Likewise, here we shall 
not identify the objects or individuals which may constitute values of the 
variables. To keep the symbolism as simple as we can in this chapter, we 
shall' also not assume any other list of names for individuals to be provided 
than the one list of variables. So, if we wish to symbolize (a x ) “Somebody 
loves Jane”, we shall have to write e.g. 3xL(x, y) (or 3xL {x,j)) and agree 
that “i/” (or “/’) is a name for the person Jane. This is not too great a 
sacrifice here, since we shall only be concerned in this chapter with general 
logical relationships, in which the special charms of Jane won’t figure 
(or at least only to the extent they arc enumerated, and the relationships 
will then apply to any other lady possessing all those same charms). Later 
(§ 28), some symbols different from variables may be provided as names 
for particular individuals. 

We return to the matter of notation for predicates. To take a mathe- 
matical example now, ”.r < ;/” can be used to name'a predicate. Then when 
(.r, y) become or take as values (2, 7) or (5, 100), x <y becomes or takes 
as value a true proposition. When ( x , y) become (7, 7) or (100, 5), x < y 
becomes a false proposition. Now if we say (h) “For every (real) number x, 
there exists a number y such that x < y" (or in our new symbolism, (h') 
V.r3// x < y), the “x < y" as part of (h) or (h') is not being used to name 
a predicate, but to say something about two numbers, the first an arbitrary 
(i.c. completely unspecified) number named by “x”, and the second a 
suitably chosen number named “y” (the choice depending on the number 
named This example illustrates that it is necessary (but we think, 
easy) to distinguish between the use of an expression like "x < y" to name 
a predicate, and its use to express a proposition which is the value of that 
predicate when “x”, “?/’ are being thought of as’ naming objects. 53 It is 

•>» If those objects are unspecified, the proposition expressed then by “x < y” may be 
called the "ambiguous value” of the predicate. Cf. IM pp. 33, 227. 
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only necessary to keep clearly in mind that a predicate P(x, y) is not a 
proposition, but a correspondence (or correlation) by which, from the 
various choices of pairs of objects as values of “x”, “y”, respective propo- 


sitions arise. 

So long as “x < y" is used just to name a predicate, it is quite satis- 
factory to think of “a;” and “?/” as place holders, not (or not yet) naming 
anything. But mathematicians, without changing “ x " and "y" to any- 
thing else, then often go over to thinking of “a;” and “y” as naming objects; 
so by a transformation in interpretation, the expression “ x < y" for a 
predicate becomes an expression for a proposition, as we have illustrated. 
It is the ease with which this transformation can be made that accounts 
in part for the popularity of variables as a notational device. 

The predicate (— propositional function) named by “* < y" can also be 
named simply Similarly, the function named “sin x" can be named 

simply “sin” or “sine”; but there is no commonly used name for the 
function z 2 + 3.r + 1 that does not contain (or some other variable 
instead). The vast majority of predicates we shall wish to name will not 
have commonly used names without variables. 

A notation for a predicate using variables we may call a name form for 
the predicate. The variables used as part of such a notation constitute the 
name form variables in that notation, and have the predicate interpretation 
or the name form interpretation. We postpone further discussion of the use 
of variables in expressing propositions. 

We began our study of the propositional calculus in § 1 by assuming that 
we are dealing with one or another object language, and that in this object 
language there are some (declarative) sentences (expressing propositions) 
which are to retain their identity throughout any particular investigation 
in the propositional calculus but are not to be analyzed in the investigation. 

Now, to get started on our study of predicate logic, we need to assume 
that the object language contains some expressions or linguistic construc- 
tions for predicates (of given numbers of variables), which expressions shall 
retain their identity throughout any particular investigation in the predi- 
cate calculus but shall not be analyzed. These expressions we call prime 


predicate expressions or ions, and we denote them by “P ’, ‘P( ) , 

“ P “P(-,-,-r;?. . , “Q”, “Q(-r, “oc- -), “Q(-, 

“R'\ . . . , also using subscripts when convenient. 54 Each capital 


oi There is some analogy with chemistry, where positive one-atom ions are atoms with 
some electrons missing. When the places for those electrons are filled by electrons, the 
ions become atoms, just as when the places for names in prime predicate expressions 
are filled by names of objects they become prime formulas. (For the present, our only 
names are variables.) 

A chemist friend suggested “neucleons” as more appropriate; but, since the analogy 
with chemistry will be imperfect anyway, we prefer the shorter word “ions”. 
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Roman letter from the latter part of the alphabet will be used as a name 
for a different n-place prime predicate expression or ion for each number 
„ > 0 of variables; thus P, P(— ), P(— , — ), P(— , — , — ) are four different 
ions (expressing respectively a 0-, 1-, 2- and 3-place predicate), and Q, 
Q( Q(_ , — ), Q( — — ) are four other ions. By including n = 0, 
we allow P, Q, R, . . . , expressing propositions, as in the propositional 
calculus: i.c. any atoms we had there which we do not now analyze further 
are allowed as the ii = 0 case of //-place ions. 

As in § 1, we are remaining silent here about what exactly the under- 
lying object language is, both because we do not wish to be drawn into 
details about it now, and because we wish to leave the way open to various 
applications. The object language may be a symbolic language constructed 
by logicians using logical symbols, and perhaps also mathematical sym- 
bols; or it may be a suitably restricted and regulated part of English or 
some other natural language, without or with mathematical symbols 


added. Now ire need to operate 
the small Roman letters “a”, “ 


with variables; and we now agree to use 
b , c , . . . , x , y , z , a x , a 2 , 


“ a ,” “x,”, “x.,”, “x 3 ”, ... as names for variables (or their equivalent 

as in (a, )-(g) above) in the object language. Our convention here is that 
distinct small Roman letters will be names for distinct variables (or their 


equivalent) in the object language, except when we say they need not be 


distinct. 55 

Sometimes it will be more convenient to consider the predicates expressed 
by the ions (with blanks as place holders) as named by expressions with 
variables (as above we had a choice between t loves — f and “x loves 
.//”). We call P(x, y, z), P(y, z, x), P(u, v, w), etc. different name forms for 
the same 3-place prime predicate expression or ion P( — , — , — ); the vari- 
ables x, y, z or y, z, x or u, v, w are the name form variables in these name 
forms, and they are to be distinct (as in the three examples). We also call 
P(x. y, z), P(y, z, x), P(u, v, w) prime predicate expressions (or ions) with 
attached (name form) variables. In a particular logical analysis, one name 
form for a given ion is the most we will need; however we may need to 
choose the name form variables in it to avoid conflict with variables 


already being used in other ways (e.g. in Exercise 19.1 below). We do not 
allow P(x, x, y), P(x, y, x) etc. as name forms for a 2-place prime predicate 
expression; for these are not prime, but exhibit that the predicate named 
arises by identifying two of the variables of a 3-place predicate expressed 


In fact, we shall reserve “r”, “r,’\ “r,’\ “r 3 ’\ ... as names for variables that need 
not be distinct from each other and the other variables present. Under our convention 
in this chapter, however, we shall repeat this each time they are used. 

Later we shall be obliged to use some of the Roman lower case letters for other 
purposes than naming variables (§§ 28, 38, 57). 
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by the ion P( — , — , — ). Because in a 3-p!ace prime predicate expression 
we exclude considering any further structure (otherwise it wouldn’t be 
prime for us), the blanks in P(— , — , — ) are to be filled independently, 
so we don’t need subscripts thus: P(— t , — 2 > —a)- Similarly, as a prime 
predicate expression, “ — loves — ” must be “ — t loves — 2 ”; and “ — < — ” 
(or simply “<”) r>u must be “ — , < — 

Now we are ready to describe a class of sentences assumed to exist in 
the object language called formulas, just as we were in § 1 after introducing 
the atoms there. But here we are starting further down in the structure 
of the object language (or assuming the object language has more than the 
minimum structure assumed there); i.e, we must now start with the ions. 

For each //-place ion P( — , and each choice of variables rj 

r„ not necessarily distinct, P(rj, . . . , r„) shall be a prime formula or atom. 
For example, from the ion P(— , — , — ), we obtain as atoms P(x, y, z), 
P(y, z, x), P(u, v, w), P(x, x, y), P(x, y, x), P(u, u, u), etc. From the ion P, 
we get just P as atom. From Q( — ) we get Q(x), Q(y), Q(u), etc. (With 
n — 0, the atoms of the propositional calculus are again atoms here. With 
n > 1 , the atoms are a more extensive class than the name forms of ions, 
since in the atoms the variables replacing the blanks need not be 
distinct.) 57 

The formulas shall comprise exactly the prime formulas or atoms, and 
the additional formulas ( composite formulas or molecules ) constructible 
from atoms by repeated introductions of the logical symbols 23, &, V, 
-i, V, 3, thus. If A and B are any formulas (either prime formulas, or 
composite formulas already constructed), then A ~ B, A 13 B, A & B, 
A V B and -lA are ( composite ) formulas. If A is any formula , and x is any 
variable, then VxA (read “for all x, A”) and 3xA (read “for some x, A’’ 
or “(there) exists (an) x (such that) A”) are ( composite ) formulas. 

56 Since “<” is a permanent notation for a binary predicate. Likewise, in a discussion 
where P(— , — ) but not P, P(— ), P(— , — , — ), . . . occur, we can write simply “P” for 
“P(— , — )” without ambiguity; etc. 

57 So long as one is working only in the predicate calculus, as treated in this chapter, 
“P(x>”, “P(x, y)’\ “Q(x, y, z)’\ .... and the similar expressions “A(x)”, “A(x, y)”, 
“B(x, y, z)”, . . . with capitals from the beginning of the alphabet (introduced below), 
can be simplified by omitting parentheses, or commas, or both thus: “Px”, “Pxy”, 
“Qxyz”, “Ax”, “Axy”, “Bxyz”. We should do so if we planned to dwell on the predicate 
calculus for a long time. However, we plan to advance rapidly to more complicated 
systems, where the “arguments” may not be simply variables x, y, z, but also for example 
5, 12, xy (= x • y) etc., which would render the notations without commas ambiguous, 
or even ones that would render the notations without parentheses difficult to read if not 
actually ambiguous. So we prefer to keep the parentheses and commas that are usually 
(but not invariably) taken as part of the notation for functions in mathematics. (The 
student is welcome to omit them in this chapter, if he won’t have trouble putting them 
back later.) 
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Vx is called a universal quantifier, and 3x an existential quantifier. 
The quantifiers act as unary operators in building formulas, and 
with our other unary operator ~ i are ranked last under the convention 
for omitting parentheses. Thus, “VxA 3 B” means (VxA) 3 B, not 
Vx(A 3 B). 5 " 

So that there will be no ambiguity as to what the atoms are, we stipulate 
that none of them be of any of the seven forms A ~ B, A 3 B, A & B, 
A V B. ~iA, VxA, HxA which the molecules have. Also each atom shall 
come unambiguously from just one ion. In brief, the internal structure of 
the ions (whatever it may be) shall be such that it cannot get mixed up 
with what is added in building formulas and shown explicitly in our 
symbolism for formulas. 

For example, " — is a man”, “ — loves — ”, “— = — ”, “ — < — ”, 

“ 1 = — ”, “2 • 2 = 4” could be ions (with 1, 2, 2, 2, 3, 0 places, 

respectively). Then “x is a man”, “// is a man”, “x loves y", “x loves x”, 
"x = //”, “t/ = y'\ “x + y = z”, “x + x = y" , “2 • 2 = 4”, etc. will be 
atoms; and these can include “Socrates is a man” and “Chiron is a man”, 
or “John loves Jane”, if we interpret “x” and “y” as names for Socrates 
or Chiron, or John and Jane, etc. Examples of molecules would then be 
“x is a man and x loves y”, “x loves y or x loves z”, “for some x, x loves y" 
or 3xL(x, //), etc. as in (a')-(h'). 

In § 1 we emphasized the distinction between our use of P, Q, R, . . . for 
distinct prime formulas and of A, B, C, . . . for any formulas not neces- 
sarily distinct or prime. Here we are again so using A, B, C, . . . ; and 
presently we shall so use A(x), A(y), B(x, y), etc. These capitals from the 
beginning of the alphabet, with or without variables, will be names for 
formulas built up from P, Q, R, . . . , variables, parentheses, commas, and 
the logical symbols — -, 3, &, V, — l, V, 3; and they can be names for the 
same or different such formulas. 

In integral calculus, | x 2 // ilx is not a quantity that depends on x, 
Jo •» 

though it does depend on y. Similarly, 2 x "l , >' does not depend on n, 

n— 0 

though it does on x. We can express this by saying that in the first expres- 
sion “x" is a bound variable and “//” is free: in the second, “/;” is bound 

and “x" is free. In “3x + x 2 // r/x”, the first occurrence of “x” is free 

J o 

and the other two are bound, while "//” is free in both occurrences. (The 
notation in the l bird example is unambiguous, though some people would 

The predicate calculus (or what it adds to the propositional calculus) is sometimes 
called "quantification theory”. In the literature, “(Vx)", “(x)”, “An”, “I In" are often 
used for our “V\"; “(3x)”, "(Lx)”, “V*”, “11 n” for "3x”. 
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prefer “3* + t 2 y dt'\) An example not presupposing calculus is the 

least i) such that 2 ij > x"\ here “a:” is free and y is bound. 

Similarly, we have bound and free variables or occurrences of variables 
in the predicate calculus, where the two operators which bind variables 

are the quantifiers Vx and 3x (rather than $ ... dx or ]£ or ^ ie * east 2/ 
such that”). 5 ” 

Consider the formula 

(1) Vx(P(x) & 3xQ(x, z) 3 3yR(x, y)) V Q(z, x). 

In the part 3xQ(x, z), each x is bound by the 3x, which we can indicate 
by attaching a subscript , to. these two x’s to show that they belong to- 
gether. Similarly we can indicate by subscripts 2 and 3 the variable occurren- 
ces bound by 3y and Vx, respectively. Note that, since the x in Q(x, z) is 
already bound by 3x, it is not free in the part P(x) & 3xQ(x, z) D 3yR(x, y) 
on which the Vx operates (call that part the scope of that Vx; cf. Example 1 
in § 1), so the Vx cannot bind it. In supplying the subscripts we therefore 
always work from the inside out, following the order of the steps by which 
the formula is built up from its atoms P(x), Q(x, z), R(x, y), Q(z, x). To 
standardize the method of numbering, we can further agree to work at 
each stage on the leftmost “eligible” quantifier, i.e. on the leftmost quanti- 
fier whose scope contains no other quantifier not yet treated. In this manner 
we obtain 

( 1 a) Vx 3 (P(x 3 ) & 3x l Q(x 1 , z) 3 3y 2 R(x 3 , y 3 )) V Q(z, x). 

The variable occurrences not thus receiving subscripts (two of z and one 
of x) are free. As another example, consider 

(2) Vy(P(y) & 3xQ(x, z) D 3zR(y, z)) V Q(z, x). 

Supplying subscripts, we get 

(2a) Vy 3 (P(y 3 ) & 3x l Q(x 1 , z) D 3z 2 R(y 3 , z 2 )) V Q(z, x). 

Erasing the bound (occurrences of) variables in (la) and (2a) gives the 
same expression from both, namely 

(lb), (2b) V 3 (P( 3 ) & 3 iQ( i, z) =3 3 2 R( 3 , 2 )) V Q(z, x). 

s» Hilbert and Bernays 1934, 1939 and some other authors use different letters for 
free and for bound variables, say "a", “b’\ “c”, ... for free occurrences only and 
“■c”, "y”, “z”, ... for bound occurrences only (contrary to the practice in informal 
mathematics). The writer did so during a decade in teaching the material of Part II of 
IM, before changing in 1946; he is now convinced that using one list of variables for 
both free and bound occurrences has a slight but definite advantage. 
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This illustrates that the two formulas (!) and (2) are congruent. They are 
not congruent to (3), (4) or (5) (below); for, upon supplying subscripts 
and erasing the bound variable occurrences, we would obtain expressions 
(3b), (4b) and (5b) each differing from (lb). For formulas of moderate 
length, it may be easier to use lines (instead of subscripts) to indicate 
which quantifiers bind which variable occurrences, thus: 


(1) 

Vx(P(x) & 3xQ(x, z) 3 3yR(x, y)) V Q(z, x). 
( 1 1 

l ~l l 1 

(2) 

Vy(P(y) & 3xQ(x, z) 3 3zR(y, z)) V Q(z, x). 

i i i 

it i — n 

(3) 

Vx(P(x) & 3 xQ(x, z) 3 3xR(x, x)) V Q(z, x). 
II 

1 H 1" '1 

(4) 

V/(P(z) & 3xQ(x, z) 3 3yR(z, y)) V Q(z, x). 
i 1 _J 1 

(5) 

Vx(P(x) & 3xQ(x, z) => 3yR(x, y)) V‘Q(z, y). 


The student can compare these figures to determine congruence or incon- 
gruence, disregarding the bound variable occurrences or imagining them 
erased. In (3) the seventh variable occurrence (an occurrence of x) is bound 
by the third quantifier (the second 3x), whereas in (I) the seventh variable 
occurrence (an occurrence of x) is bound by the first quantifier (the Vx). In 

(4) the fifth variable occurrence (an occurrence of z) is bound by the first 
quantifier (the Vz), whereas in (1) the fifth variable occurrence is free. In 

(5) there are no such differences from (I) in the bound variable occurrences, 
but the hist variable occurrence is a free occurrence of y, whereas in (I) it 
is a free occurrence of x. 

Of course, if two formulas are not the same to within the choices of 
variables, they are incongruent anyway. 

Each formula expresses a predicate of the number of variables which 
occur free in it (those variables serving as name form variables). For 
example, is a man”, “ x+x = x”, L{x, x), 3xL(x, y) express predicates 
of one variable; “ x < y", "x < y V x = L{x, //), 

Vx(P(x) & 3,\Q(x, z) Z> 3yR(x, y)) V Q(z, x) express predicates of two vari- 
ables; and "2 • 2 = 4” expresses a predicate of 0 variables, i.e. a proposition. 
(Formulas can also be considered as expressing predicates of more 
variables. Thus “2 • 2 = 4” also expresses a constant predicate of one 
variable, or of two variables, etc.; and L(.r, x) expresses a predicate of 
two variables .»•, y which is constant in the second variable y.) 



DOMAINS. VALIDITY 


83 



But as we remarked above in our discussion of notation for predicates, 
by interpreting the variables as standing for particular objects the formulas 
come to express not the predicates but propositions taken as their values. 
(There are other ways in which a formula with free variables can be inter- 
preted as expressing a proposition, as will be discussed in §§ 20, 38.) 

The formulas (1) and (2) express the same predicate, with z, x as the 
name form variables, while (3) and (4) express two other predicates, again 
with z, x as the name form variables. (These three predicates depend on 
what predicates the ions P( — ), Q( — , — ), R( — , — ) express.) This should 
be evident now from the words proposed for reading the quantifiers; and 
in § 17 it will be further emphasized. Formula (5) expresses the same predi- 
cate as (1) and (2), but using instead z, y as the name form variables. If we 
should prefix Vx to (I), (2) and (5) after enclosing them in parentheses 
(to give Vx their wholes as scope), the third resulting formula would not 
be congruent to the first two; indeed, the first two would express one- 
variable predicates, the third still a predicate of two variables. If we inter- 
pret x, y, z as names for objects, then (5) will express a different proposition 
than (1) and (2) if the objects named by x and y are different. 

Exercise. 16.1. Show by subscripts (or lines) which quantifiers bind 
which variable occurrences. Which pairs of formulas are congruent? 

(a) Vz3y( P(z, y) & VzQ(z, x) 3 R(z)). (f) 3zVx(P(z, x) V VzQ(x, y, z)). 

(b) Vx3y(P(x, y) & VyQ(y, x) 3 R(x)). (g) 3zVx(P(z, x) V VxQ(x, y, z)). 

(c) Vy3z(P(y, z) & VzQ(z, xj 3 R(y)). (h) 3yVx(P(y, x) V VxQ(x, y, z)). 

(d) Vz3x(P(z, x) & VzQ(z, y) 3 R(z)). (i) 3yVx(P(z, x) V VxQ(x, u, y)). 

(e) Vy3z(P(z, y) & VzQ(z, x) 3 R(y)). (j) 3xVz(P(x, z) V VuQ(u, y, x)). 


§ 17. Model theory: domains, validity. We are now at the stage 
corresponding to the beginning of § 2 in Chapter 1. There we said that, for 
the classical propositional calculus, each atom (or prime formula) is 
assumed to express a proposition that is either true or false but not both 
(but which is the case is not a datum for the propositional calculus). 

Now, for the classical predicate calculus, we sh'alFwislTTo make a cor- 
responding assumption about each ion (or prime predicate expression). 
But the first step toward properly talking about the «-place predicate 
( = propositional function of n variables) expressed by an n-place ion is 
to consider what objects m ay be values of the variables, o r in mathem atical 
terminology what are the ranges of the variables. In examples like “x is a 
man” or “x loves y", a meaningful statement, and thus one expressing a 
proposition which classically we can regard as true or false, will not 
necessarily result whatever noun is substituted for “x” or whatever nouns 
for “.c” and "y". It is debatable whether "x loves y" becomes false or 
simply meaningless, when we substitute for and “y" names of veget- 
ables. Furthermore, in ordinary language one can find borderline cases 
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when it is not clear whether anything is named by an expression ostensibly 
serving as a noun. We shut off debate on these issues, at least for now, by 
assuming that there is a particular nonempty set or collection of objects, 
called the domain D, over which each of the (independent) variables of our 
propositional functions ranges; i.e. the members of D are the objects to be 
allowed as values of the variables. 

This is not at all a trivial assumption, since it is not always clearly 
satisfied in ordinary discourse. In mathematics likewise, logic can become 
pretty slippery when no D has been specified explicitly or implicitly, or the 
specification of a D is too vague. 

In saying that D shall be the range of each variable of our propositional 
functions, we mean that the predicate expressed by the ion P( — , — ), or 
by its name form P(x, y), becomes a proposition, or'as mathematicians say 
is "defined”, for each pair of values chosen for x and y from the set D\ 
and similarly with P(x lt . . . , x„) for any n > 0. (For n = 0, D is not 
involved.) For example, the predicate x < y can’t be used when D is 
the set of the complex numbers a + bi , since x < y isn’t defined (meaning- 
ful) for every pair x, y in that D. The predicate x < y can be used when 

D is the real numbers, or the natural numbers 0, 1, 2 But then 

\'r = y is not “always defined”. In none of the three D's just mentioned 
is .r -r // = z always defined (it isn’t defined when y — 0). 8U 

For our study of the predicate calculus in general (i.e. without a stated 
further restriction), we shall consider that we do not know what nonempty 
set D is. In other words, we undertake to develop the predicate logic 
applicable to any nonempty D whatsoever. Thus we are excluding from 
the sets as possible choices of D only the empty set; i.e. there shall be at 
least one object in the range of our variables. (E.g. D cannot be the real 
roots of the equation x l + 1 = 0.) 61 

In mathematics, often different variables are employed with different 
ranges, like x, ?/, z, . . . ranging over real numbers and in, n,p, . . . over 
natural numbers. To keep matters as simple as possible, we are not pro- 
viding for this now; all our variables are to have the same range D (though 
what this D is can be varied in our applications). It is not difficult, after 
the predicate calculus has been treated thus (as one-sorted) to proceed to 
predicate calculus with two sorts of variables, some with one range D x 
and some with another D 2 , called two-sorted predicate calculus. Similarly, 
with more than two sorts. 82 

611 la such situations, mathematicians often get around the difficulty, if they need to, 
by extending the D or extending the definition of the predicate on the original D. 

61 The predicate calculus with the empty domain allowed presents some differences 
(that on the whole are not advantages) compared to the treatment here. Cf. Mostowski 
1951a, Jaskowski 1934. 

82 See 1M pp. 179- 180. and references given there (or, for higher-order predicate 
calculi, Hilbert and Ackermann 1949, Church 1956). 
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§ 17 

Another form of predicate calculus treats the ions as variables which 
may also be quantified, so VP, VQ, 3P, 3Q, . . . become part of the sym- 
bolism. This gives a more considerable extension of the predicate calculus 
(as we study it), called second-order predicate calculus; and on iteration, 
higher-order predicate calculi.' 12 To distinguish the form of the predicate 
calculus which we treat from these, it may be called restricted or lower or 
first-order predicate calculus. 63 

Now we make one more assumption, for the classical predicate calculus, 
as we intimated we would do paralleling the treatment in § 2. This assump- 
tion is that, for each pair of values of x, y taken from D, the proposition 
which results as value of P(x, y) is either true (t) or false (f) but not both. 
(However we are not told which is the case.) Considering that this happens 
for each x, y in D , it comes to the same thing to say that there is correlated 
to the ion P(— , — ) or its name form P(x, y) a function I(x, y) which, for 
each pair of values ofx, y in D, takes either t or f as value (in mathematical 
terminology, a function l(x, y) from D x D to {t, f}). 04 Such a function 
I(x, y) we call a ( 2-place ) logical function. Similarly, to each ion P(X], . . . , x„) 
with n attached variables, an /(-place logical function I(x l , . . . , x„) is cor- 
related. In the case n = 0, i.e. simply P, the logical function l(x 2 , . . . , x„) 
is simply a t or f, as in the propositional calculus. 

The truth tables given for ~, D, &, V, -i in the propositional calculus 
(§ 2) shall again apply. We also define now the process of evaluating VxA 
and 3xA. We shall have occasion to evaluate these only when we are 
already in a position to evaluate A for each choice of a member of D as 
value of x at its free occurrences in A, or briefly when we can evaluate A 
by a logical function of x. We define VxA to be true (t) if this logical 


63 Our one kind of variables may be called “individual variables” or “object variables” 
to emphasize that they range over the domain D of individuals or objects. This is in 
contrast to e.g. “predicate variables” in second-order predicate calculus, or in first- 
order predicate calculus with a postulated substitution rule for such variables. 

64 Here we are deviating from our usual notational conventions, under which we 
should (and occasionally will) write instead: a function I(x, y ) which, for each x, y in D 
(or for each pair of values of “x” and "y" in D) takes either t or f as value. 

For, it is convenient to use the same variables x, y in our name “l(x, y)” for the logical 
function as in the name form P(x, y) for the ion P( — , — ) to which it is correlated or 
assigned as value. So here, where the actual variables in the formula P(x, y) are un- 
specified, we use our names “x” and “y” for them in our name "I(x, y)” for the logical 

function. Similarly, with P(xj x„) and “l(x,, .... x„)” (and rarely in naming 

predicates expressed by formulas). 

Then, in the computations or evaluations (below) we shall simply substitute our 
names for logical functions (“I^x)”, "I 2 (x)", “(,(x, y)”, etc.), for truth values (“t”, “f”) 
and for members of D (“1”, “2”, etc.) into the formulas of the object language (or into 
our names for them). So there we manipulate an object language augmented by “bfx)”, 
“t”, “1”, etc. (We did this already with “t”, “f” in Chapter I; and we shall do the like, 
with also functions having values in D, in Chapter III §§ 28, 29, etc.) 
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function has t for all its values, otherwise false (f) ; and 3xA to be t if this 
logical function has at least one t among its values, otherwise f. 

Now, can we compute a truth table for any formula E? To begin with, 
D , though supposed fixed, is unknown. Actually, only the number D (> 0) 
of members of D (still unknown) matters.® 5 

Exampi.I: I. For illustration, however, let us suppose D is a domain of 
two objects, which for convenience we write simply “1” and “2”; i.e. 
D = {1, 2}. Take as E the formula P(y) V Vx(P(x) D Q). To compute a 
truth value for this, we must start from an assignment consisting of a 
logical function of one variable ranging over D as value of the ion P( — ) 
or its name form P(x), a truth value (or logical function of zero variables) 
as value of Q, and a member of D as value of the free variable y; i.e. we 
shall compute a table to be entered from these three quantities. Before 
computing this table, we list the 4 (= 2 2 ) possible logical functions of one 
variable over D ( = {I, 2}), as follows: 


X 

«x) 

«x) 

Mx) 

U(x) 

1 

t 

t 

f 

f 

2 

t 

f 

t 

* f 


Here is the tabic for P(y) V Vx(P(x) D Q): 



P(x) 

Q 

y 

P(y) v Vx(P(x) D Q) 

1. 

l,(x) 

t 

i 

t 

2. 

l,(x) 

t 

2 

t 

3. 

It(x) 

f 

1 

t 

4. 

!,(x) 

f 

2 

t 

5. 

Mx) 

t 

1 

t 

6. 

E(x) 

t 

2 

t 

7. 

«x) 

f 

1 

t 

8. 

I'i(x) 

f 

2 

f 

9. 

Mx) 

t 

1 

t 

10. 

Mx) 

t 

2 

t 

11 . 

Mx) 

f 

1 

f 

12. 

Mx) 

f 

2 

t 

13. 

Mx) 

t 

1 

t 

14. 

Mx) 

t 

2 . 

t 

15. 

Mx) 

f 

1 

t 

16. 

Mx) 

f 

2 

t 


1,3 As will appear below (ij 34), we can talk about the number D even when D is 
infinite. 
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P(y)VVx(P(x)3Q) 

(i) I 2 (2) V Vx(I 2 (x) 3 f) 

(ii) f V f 

(iii) f 

The first step is to substitute the assignment represented by Line 8 into the 
formula to be computed; this gives (i). Toward (ii), we get f as the value 
of 1 2 (2) by the table for l 2 (x) above. But before we can evaluate the other 
part Vx(l 2 (x) 3 f) of (i), we need to compute l 2 (x) 3 f as a logical function 
of x; the result is shown in the supplementary table at the left below; and 
the computations of its two lines are at the right. 


X 

l/x)3f 

L(x) 3 f 

l*(x) 3 f 



L(l) 3 f 

W2) => f 

1 

f 

t =>f 

f =>f 

2 

t 

f 

t 

Continuing the main computation, 

since the supplementary table does not 


have all t’s, Vx(I 2 (x) 3 f) is evaluated as f; so we get (ii). Finally we get 
(iii) by the table for V. 

This illustrates the definition of the table for a formula E, for the given 
D. As before, shortcuts are possible. In our example, the observation that 
A 3 B is t whenever B is t shows that, whenever Q is t, P(x) 3 Q will have 
a supplementary table of all t’s, so by our prescription for evaluating VxA 
the value of Vx(P(x) 3 Q) will be t, so by the table for V the whole will 
be t. Thus we can write t in Lines I, 2, 5, 6, 9, 10, 13, 14 without further 


ado. li(i 

Example 2. Again with £> = {1,2}, we give several lines of the table 
for Vx(3xP(x) 3 P(x)) & P(x). 


P(x) x Vx(3xP(x) 3 P(x)) & P(x) 


1. 

lt(x) 

1 

t 

2. 

I.(x) 

2 

t 

3. 

l-j(x) 

1 

f 

8. 

f 4 (x) 

2 

f 


66 The method of truth-value analysis, illustrated by (3) in § 8, can be applied succes- 
sively to each proposition letter in E (i.e. in this example, just to Q). 
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Here 

: is the computation for the entry in 

Line 3. 


Vx(3xP(x) 3 P(x 

)) & P(x) 

(i) 

Vx(3xl,(x)3I,(x)) &l a (l) 

(ii) 

f 

t 

(iii) 


f 


In (i), we use the value I of x only at the free occurrence of x (the last one) 
in Vx(3\P(x) 3 P(x)) & P(x). To get (ii) we need a supplementary table 
for 3xL(.\) =3 !,(x) as a function of x; in setting this up we ignore the value I 
previously assigned to x for the whole formula Vx(3xP(x) 3 P(x)) & P(x). 
The supplementary table follows at the left, with the computations of its 
two lines at the right. 

x 3xl.,(x) 3 L(x) 3x!,(x) 3 l,(x) 3xl a (x) 3 l a (x) 

3xl a (x) 3 !.,( I ) 3xl 2 (x) 3 1,(2) 

t 3t t 3f 

t f 

For the two computations at the right, we need a further supplementary 
table, namely for l 2 (x) (ignoring the values already given to x for the 
respective lines of the supplementary table at the left which we are engaged 
in computing); however this is simply the table for l 2 (x) as given preceding 
(I) in Example 1. 

It should be clear from these examples (and the others below) that, 
provided a domain D has been selected and is finite, a table can be com- 
puted for any given formula E, at least injheory (i.e. 'disregarding practical 
limitations). Of course, for large finite D (or even for a small B, if E is 
complicated), if no shortcuts are used, the computation may be of imprac- 
tical length. If D is infinite, the table is no longer a finite object which in 
theory can be computed; but what is meant by the table should be clear 
enough (from the standpoint of classical mathematics), and we may be 
able to reason about it. When D may be infinite, we shall avoid the word 
"compute” and say instead “evaluate” or “determine”. 

When can a formula E be said to be true on the basis of only the predi- 
cate calculus? Considering that both the D (or D), and the logical func- 
tions over D as values of the ions in E (or truth values in the case of zero 
variables) and the members of D as values of the free variables of E, will 
be unavailable, the answer must be as follows: The formula E is true 
on the basis of the predicate calculus, exactly if, for each choice of D (or 
of the number B of its elements), the resulting truth table has only t’s in 
its value column. In this case we say E is valid ( in the predicate calculus) 
and write h E. (Tn this chapter, it will be understood that “valid” and “h” 
refer to the predicate calculus, unless the contrary is stated.) 

It is also often of interest to consider the predicate calculus supplemented 


1 t 

2 f 



DOMAINS, VALIDITY 


89 


§ 17 


by a choice of D (or of the number 5 of members of D) \ we then say E is 
valid in the domain D or is D-ualid, and write 5-1= E, exactly if the truth 
table of E for the chosen D has all t’s. Interesting cases are 5 = k (a 
positive integer), and D is the set of the natural numbers {0, 1, 2, . . 

There is a vast difference now from the situation we had in the proposi- 
tional calculus. There each question as to the validity of a formula E 
could (in theory) be settled mechanically by computing the truth table. 
Now the definition of validity refers to a whole infinite family of truth 
tables, one for each D, and for an infinite D we cannot (even theoretically) 
compute the table. For validity, every one of these tables should give all 
t’s. Despite this difficulty, we shall see that logical theory has gone quite 
far in solving problems of the predicate calculus. 

For a demonstration of invalidity, it suffices to find just one D and 
one line of the table for this D which gives f. Thus we already know that 
P(y) V Vx(P(x) D Q) is not valid, because we found an f in Line 8 of its 
table (1) for 5 = 2.- 

The notion of validity, e.g. as applied to the E of Example 1, arose by 
considering E as expressing a proposition with its free variable y naming 
some member of D. Since we were in ignorance, not only of D and the 
values of P(— ) and Q, but also ott h eunembe r .of 5. named by.y,_. we, could 
.know (using tb.e predicate calculus, and nothing else) fhat Eds , trite, when, 
and only when,, for. e very D , .thfi Ja.bte Joj^E^bnsJilLt’s.^ 

But formulas also serve as names for predicates, as we stressed in § 16. 
Indeed, we constructed the supplementary table for I 2 (x) D f from this 
point of view, and not because we were interested in whether P(x) Q 
expresses a true or false proposition. For the chosen D, and the assigned 
values of P(x) and Q, the supplementary table gives the logical function 
which then evaluates P(x) => Q interpreted as standing for a predicate 

Ofx.«7.64 

If we are interested in the whole formula E as expressing a predicate 

67 We can think of truth values and logical functions as X-ray pictures of propositions 

fjr/2 

and predicates. Thus “ I < 2”, “ sin * dx — 1 ”, “Socrates is a man” and “Madison 

Jo 

is an inland city” are four propositions, with quite different meanings; but under the 
X-ray the flesh becomes invisible, and only t shows. Likewise, “2 < 2”, 

>/2 

sin x dx = 2”, “Chiron is a man” and “New York is an inland city are four 


Jo 


other propositions, which appear under the X-ray as simply f. For D = { 1, 2}: “x < 2” 

»<r/2 

and “I sin x dx = x" when 1 and 2 are themselves, “x is a man” when 1 is Socrates 


Jo 


and 2 is Chiron, and “x is an inland city” when 1 is Madison and 2 is New York, are 
four different predicates (their respective values, listed above, are eight different 
propositions); but under the X-ray all four of them appear as the logical function l 2 (x). 
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rather than a proposition, the distribution of t's and f’s in its truth table 
will interest us, not just whether they are all t’s or not all t’s. This we can 
illustrate by going one step further to construct the formula VyE or the 
formula 3yL- from the formula E of Example I. For D = {1, 2}, VyE and 
ByE each have an 8-linc truth table, -since their tables arc entered from a 
value of P(x) and of Q, but not of y. The supplementary tables that we 
need in the next to the last step of computation appear as subtables of (1); 
indeed, the pairs of lines 1-2, 3-4, . . . , 15-16 give these supplementary 
tables. Thus inspection of (1) enables us at a glance to write the tables for 
VyE and ByE: 

P(x) Q Vy(P(y) V Vx(P(x) => Q)) 3y(P(y) V Vx(P(x) => Q)) 


1. 

h(x) 

t 

t 

t 

2. 

Il(x) 

f 

t 

t 

3. 

l,(x) 

t 

t 

t 

(2) 4. 

4(x) 

f 

f 

t 

5. 

4(x) 

t 

t 

t 

6. 

Wx) 

f 

f 

t 

7. 

h(x) 

t 

t 

t 

8. 

4(x) 

f 

t 

t 


From (2), we see that Vy(P(y) V Vx(P(x) D Q)) is not valid, but 
3y(P(y) V Vx(P(x) 3 Q)) is at least 2-valid. 

Example 3. As another illustration, we shall show that 
Vx3yP(x, y) 3 3yVxP(x, y) is not valid, by computing one suitable line of 
its table for the domain D = { 1,2). This table must be entered from a 
logical function as value of P(x, y). We first list the 16 (= 2*) possible such 
logical functions, as follows: 

x y (i(x, y) l 2 l 3 I 4 L, I 6 4 l* 4 Iio hi ha ha Li hs 4« 

11 t tttttttffffffff 

12 t tttffffttttffff 

2 1 t tffttffttff tiff 

2 2 t f t f t f t f t f t f t f t f 

Here is the truth table for our formula showing the entry for Line 10; the 

fact this entry is f proves the invalidity of the formula. 

P(x, y) Vx3yP(x, y) =3 3yVxP(x, y) 


10. l I() (x, y) 
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Here is the computation for the entry in Line 10 (explanation follows): 
Vx3yP(x, y) 3 3yVxP(x, y) 

(i) Vx3yl lu (x, y) 3 3yVxl lu (x, y) 

(ii) t 3 f 

(iii) f 

The lirst step is to substitute the assignment represented by Line 10 into 
the formula to be computed; this gives (i). 

Before we can evaluate the part Vx3yl )0 (x, y), we need to compute 
3yIio(x, y) as a logical function of x. We construct the following table (a), 
in which we tabulate this logical function (explanation follows). 

x 3yl 10 (x, y) 



.To compute the truth values in (a), we must in turn construct subsidiary 
tables as follows, (b) for the case in which the value of x is 1, and (c) for 
the case in which x is 2. 


y 

Mi.y) 

y 

lio(2, y) 

(b) j 

f 

(c) , 

t 

2 

t 

2 

f 


The values in (b) and (c) we of course obtain from our table of the possible 
logical functions of two variables over D. In (b) a t appears, so that by 
the evaluation rule for the existential quantifier 3y we get t in Line 1 of (a); 
and similarly in Line 2. 

In Table (a) we now have all t’s, so by the evaluation rule for V we get t 
to use as the value of Vx3yl 1() (x, y) in (ii). 

Proceeding similarly to evaluate 3yVxl J0 (x, y), we need to compute 
Vxl 10 (x, y) as a logical function of y. We construct the following table (a'), 
in which we tabulate the truth values of this function. 


y Vxl 10 (x, y) 



To determine the truth values in (a'), we must again construct two sub- 
sidiary tables (b') and (c')> 


x 

Iu»(x> 1) 


X 

Iio(X) 2) 

(b') j 

f 

(O 

1 

t 

2 

t 


2 

f 
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In each of (b') and (o') we fail to have a solid column of t’s, so by the rule 
for evaluating V the two values in (a') are f. Since we do not have any t 
in the value column of (a'), the rule for evaluating 3 gives us f as the 
value of 3yVx! 10 (x, y) to use in (ii). 

Thus wc reach the situation depicted in (ii) of the computation of the 
entry in Line 10, and can proceed to (iii) to get the value f in Line 10 for 
the whole formula. 

Example 4. We shall now show the formula P(y)D3xP(x) to be 
valid. In doing so, we cannot help using some general reasoning, as we 
must show that we get all f’s in the table for any D. However, to help 
ourselves picture the situation, we begin by taking D =» {1, 2, 3}. The 1- 
place logical functions are now 8 (= 2 3 ) in number, as follows: 

x li(x) l 2 (x) l 3 (x) l 4 (x) I 6 (x) l„(x) l 7 (x) l 8 (x) 


1 t t t t f f f f 

2 t t f f t t f f 

3 t f t f t f t f 

The table for P(y) D 3xP(x) will have 24 ( = 8 • 3) lines, since all 8 logical 
functions have to be listed under P(x), each with each of the 3 members 
of D as y. We show two lines as a sample. 


P(x) y P(y) D 3xP(x) 


14. 

fr,(x) 2 

t 

22. 

l 8 (x) 1 

t 


For Line 14, note that I 5 (x) has a 1 in its table, e.g. for x = 2. Hence by 
the rule for evaluating 3x, the part 3xP(x) takes the value t; so by the 
table for D, the whole is t. This consideration suffices for the first 21 lines 
of the table, in each of which the l(x) has a t in its table; i.e. it suffices for 
every t(x) except l 8 (x). For Line 22 on the other hand, 1 8 (1) is f; so by the 
table for D, the whole is t. This consideration suffices for the last three 
lines, in which, since I 8 (x) has only f’s in its table, I 8 (y) and thus P(y) will 
be f whatever y is. It should be clear now that for any D, even an infinite 
one, the table for P(y) D 3xP(x) will have all t’s. The demonstration is 
given by classifying the assignments to P(x), y (or the “lines”). First, con- 
sider any assignment with an I(x) as value of P(x) other than the logical 
function with all f’s; then 3xP(x) is t, so the whole is t. Second, consider 
any line with the l(x) whose table is all f’s as value of P(x); then P(y) is f 
whatever the value assigned to y, so again the whole is t. 
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The same reasoning shows that P(x) 3 3xP(x) is valid; for any assign- 
ment, the value assigned to x for the whole formula P(x) 3 3xP(x) is 
ignored in evaluating 3xP(x) from the logical function assigned as value 
to P( — ). (Cf. Example 2.) 

Altogether, we can thus say that ¥ P(r) 3 3xP(x), for any variables x 
and r, where r is not necessarily distinct from x. 

Exercises. 17.1. How many lines are there in the truth table for 
D = { 1, 2}? Compute in full detail the line indicated. 

(a) Vz(P(x) 3 iQ V P(z)) when P(x), Q, x are I 3 (x), t, 2. 

(b) P(x, y) 3 Vx(P(x, y) 3 3xP(x, x)) when P(x, y), x, y are l 14 (x, y), 2, 1. 

17.2. Show that Vx3yP(x, y) 3 3yVxP(x, y) is I -valid, by computing in 
full its table for D = {1}. (By Example 3, it is not 2- valid.) 

17.3. Show that each of the following formulas is invalid. 

(a) -i[Vx3yP(x, y) 3 3yVxP(x, y)]. (b) 3x3yP(x, y) 3 3xP(x, x). 

(c) 3xP(x) & 3xQ(x) 3 3x(P(x) & Q(x)). 

17.4. Give a demonstration by cases (classifying the assignments) that 
VxP(x) 3 P(y) (and VxP(x) 3 P(x)) is valid. 

17.5. Is the formula valid? (Show why or why not.) 

(a) P(x) 3 VxP(x). (b) 3xP(x) 3 P(x). 

(c) VxP(x) 3 3xP(x). (d) 3xP(x) 3 VxP(x). 

(e) 3y(P(y) V Vx(P(x) 3 Q)). (Cf. the right column in (2).) 

17.6. Show that, for any variable x and formula A: ¥ A if and only if 
¥ VxA. Similarly, with more variables. 

17.7*. Find (a) a formula which is I -valid and 2-valid but not 3-valid, 
and (b) a formula which is I-, 2- and 3-valid but not 4-valid. 


§ 18. Model theory: basic results on validity. It is a little fussy 
to extend Theorem 1 § 3 to the predicate calculus, so we postpone that to 
§ 19 (Theorem 17). However, in the special case that the formula E into 
which we substitute is a formula of the propositional calculus, the reasoning 
we used in §2 suffices. Thus: Theorem 1 holds (with “t=” referring to the 
predicate calculus) when E is any formula of the propositional calculus 
containing only the atoms (i.e. 0-place ions) P lt . . . , P„ but A„ . . . , A„ 
are any formulas of the predicate calculus. Consequently: Theorem 2 
holds when A, B, C are any formulas of the predicate calculus. Also by 
the same reasoning as before: Theorem 3 holds when A, B are any 
formulas of the predicate calculus. We defer the extensions of Theorems 
4-7a to § 19. 

In the next theorem (Theorem 1 5), we generalize the results of Example 4 
and Exercise 17.4. This will come under the extension of Theorem 1 to the 
predicate calculus (Theorem 17), but again the special case in question is 
simpler. 
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To generalize the result that N P(r) 3 3xP(x) (Example 4), we shall 
replace the two atoms P(x) and P(r) by any two formulas suitttbly related 
to each other, i.e. so related that the reasoning in Example 4 will still apply. 
Toward this end, consider any formula whatsoever, which we shall call 
•'A(x)" rather than simply “A”. Now we denote by “A(r)” the result of 
substituting r for the free occurrences of x in A(x). For example, if A(x) 
is V/(Q(x) V VxP(x, y) V P(z, x)) and r is y, then A(r) is 
Vz.(Q(y) V VxP(x, y) V P(z, y)). With this notation, our proposed general- 
ization of “f P(r) 3 3xP(x)” will read “t= A(r) 3 3xA(x)”, as though it 
simply consists in changing “P” to “A”; but we must not forget what is 
behind the notation. 

Will the reasoning which in Example 4 established that 1= P(r) 3 3xP(x) 
now give that f A(r)3 3xA(x)? It will if A(r) differs from A(x) exactly by 
A(r) liar iny a free occurrence of r in each position where A(x) has a free 
occurrence of x. for then, whatever domain D we consider and whatever 
assignment (or line in the table) for that D , the value' of A(r) will be 
among the values in the supplementary table for A(x) used in evaluating 
3xA(x); namely, the value of A(r) will be the same as the value of A(x) 
when x has the value of r. This is what was essential to our reasoning in 
Example 4. (A concrete example will follow.) 

By the way we obtained A(r) from A(x), A(r) differs from A(x) exactly 
by A(r) having an occurrence of r wherever A(x) has a free occurrence 
of x. But are these occurrences of r in A(r) all free? It depends on what 
variables and formula x, r and A(x) are. If these occurrences (i.e. the 
occurrences of r in A(r) resulting from the substitution of r for the free 
occurrences Of x in A(x)) are all free, we say that r is free for x in A(x) or 
that the substitution of r for x in A(x) (with result A(r)) is free. In this 
case, as we have said, the former reasoning carries over and establishes 
that t= A(r) 3 3xA(x). 

Thus we obtain (b) of the theorem. Similarly, (a) generalizes Exercise 
17.4, first, we repeat the key notational convention and definition. When- 
ever vve introduce a notation like “A(x)” for a formula showing a variable 
x in the notation, vve shall thereafter understand by “A(r)” for any variable 
r the result of substituting r for the free occurrences of x in A(x). (It is not 
required that the formula denoted by “A(x)” actually contain x free; if 
A(x) does not contain x free, then A(r) is simply A(x) itself. Also it is not 
excluded that A(x) may contain free other variables than x.) We call r free 
for x in A(x), or say the substitution is free, if the resulting occurrences of r 
in A(r) are free. 

TiiLORrM 15. Let x be any variable , A(x) be any formula, r be any 
variable not necessarily distinct from x, and A(r) be the result of substituting 
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r for the free occurrences of x in A(x). If r is free for x in A(x), then: 

(a) 1= VxA(x) D A(r). (b) f A(r) D 3xA(x). 

The proof has already been indicated; but we shall illustrate it in an 
example. Also we shall show how the reasoning (and also the conclusion) 
fails in another example that does not satisfy the final hypothesis. 

Example 5. Let A(x) be Vz(Q(x) V VxP(x, y) V P(z, x)) and r be y. 
Then A(r) is Vz(Q(y) V VxP(x, y) V P(z, y)), where exactly the first and 
third occurrences of y result from the substitution for the free occurrences 
of x in A(x). These occurrences of y are both free. Thus r is free for x in 
A(x). So the theorem applies, and tells us that A(r) Z) 3xA(x), i.e. 

Vz(Q(y) V VxP(x, y) V P(z, y)) Z) 3xVz(Q(x) V VxP(x, y) V P(z, x)), 

is valid. To illustrate how the proof of the theorem applies to this case, 
consider for example the domain D = {1,2} and the assignment of I 4 (x), 
l 7 (x, y), 2 to Q(x), P(x, y), y respectively (cf. Examples 1 and 3). We get 
the two entries in the supplementary table for A(x) entered from x (as 
required to evaluate 3 x A fx)ytjye v a 1 u a tfh g the two expressions 
A(l): Vz(I 4 (l) V Vxl,(x, 2) V l 7 (z, 1)), 

A(2): Vz(t 4 (2) V Vxl 7 (x, 2) V l 7 (z, 2)), 

while the value of A(r) is that of the expression 
A(r) : Vz(I 4 (2) V Vxl 7 (x, 2) V l 7 (z, 2)) ; 

the latter is the second of the two expressions to be evaluated for the 
supplementary table. In fact, the supplementary table has only f’s (as in the 
second case in Example 4); and since the value of A(r) is one of the values 
in the supplementary table (the second in fact), it is f, so A(r) Z) 3xA(x) 
is t. If we change the example to use l 6 (x, y) instead of I 7 (x, y), then 
the supplementary table has a t (in the first line, for A( 1 )) ; so 3xA(x) 
is t, and A(r) Z) 3xA(x) is t anyway. Similarly whatever the domain D 
and the assignment, A(r) Z) 3x A(x) will be t in one of the two ways illustrated. 

Example 6. Let A(x) be as in Example 5, but let r be z. Then A(r) is 
Vz(Q(z) V VxP(x, y) V P(z, z)). Exactly the second and fourth occurrences 
of z result from the substitution; and these are not both free (in fact, 
neither is). So the substitution is not free, and the theorem does not apply. 
To see how the reasoning which establishes the theorem fails to apply 
here, consider for example the domain D = {1,2} and the assignment 
l 4 (x), ( 7 (x, y), 2. The values in the supplementary table for A(x) are those 
of A(l) and A(2) as in Example 5, but the value of A(r) is that of 
A(r): Vz(( 4 (z) V Vx! : (x, 2) V l 7 (z, z)). 

In this example, the latter expression is not one of the former two. In fact, 
the latter is t, while as before both the former are f and hence 3xA(x) is f ; 
so A(r) Z> 3xA(x) is f. Therefore A(r) Z) 3xA(x) is not valid. — 
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Fn the next theorem, we again denote a formula by a notation “A(x)” 
showing a variable x. (The formula denoted by “A(x)” is not required to 
contain x free; it may contain other variables free.) In this case, we do so 
to contrast A(x) with another formula, denoted by “C”, which shall not 
contain x free. (Usually, when we simultaneously denote some formulas 
by notations showing a variable x and others by notations not showing x, 
this is to help us remember that the latter formulas are not to contain x 
free, while the former are allowed but not required to contain x free.) 

Theorem 16. Let x be any variable, A(x) be any formula, and C be any 
formula not containing any free occurrence of x. Then: 

(a) lfY C 3 A(x), (b) lfV A(x) 3 C, 

then t= C 3 VxA(x). then 1= 3xA(x) 3 C. 

Prooe. (a) Suppose 1C3 A(x). We must show that 1C3 VxA(x). 
Choose any domain D. For this D, consider any assignment of logical 
functions and members of D to exactly the ions and free variables of 
C 3 VxA(x) (i.e. consider any line of the table entered from exactly these); 
call this the "given assignment”. Since x does not occur free in C, this does 
not include an assignment of a member of D to x. Case 1 : for the given 
assignment, C is f. Then, by the table for 3, CD VxA(x) is t. Case 2: 
for the given assignment, C is t. Then, for the given assignment supple- 
mented by any assignment to x, C is still t (cf. the discussion preceding 
Theorem 3), so, since C 3 A(x) is t (by the hypothesis l=C3 A(x)), A(x) 
is t (by the table for 3). As this was for any assignment to x together with 
the given assignment, VxA(x) is t for the given assignment, by the rule for 
evaluating V. Hence, by the table for 3, C 3 VxA(x) is t for the given 
assignment, (b) By a similar argument by cases (Exercise 18,2). 

Exercises. 18.1. Does Theorem 15 justify concluding that the formula 
is valid? (If not say why Theorem 15 does not apply; and try to settle by 
other methods whether the formula is valid or not.) 

(a) Vx3yP(x, y) 3 3yP(y, y). (d) P(x, x) 3 3yP(x, y). 

(b) Vx3zP(x, z) 3 3zP(y, z). (e) P(x, x) 3 3yP(y, y). 

(c) 3yP(y, y) 3 3x3yP(x, y). (f) VyP(x, y) 3 P(y, y). 

18.2. Prove Theorem 16(b). 

18.3. Show that Theorem 16 would not hold in general without the 
restriction that the C not contain the x free. (Hint: try P(x) as both A(x) 
and C.) 

*§ 19. Model theory: further results on validity. 68 We shall now 

examine under what conditions the reasoning which gave us Theorem 1 

6S Some of the results in this section will not be used later, and the others will be 
obtained independently in proof theory. 
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in § 3 will hold good in the predicate calculus. (The student should review 
the proof of Theorem I.) 

First, we consider the mechanics of substituting for an ion. Since an 
ion, e.g. P( — ), stands for a propositional function, the process of substitu- 
tion for P( — ) is similar to that for a function variable in mathematics 
rather than simply for a number variable. 

For example, consider the statement 

(a) Vz[/(-z) + /(*) = 2/(0)], 

which is true for some functions / and false for others. (So long as we are 
concerned only with the mechanics of substitution, the truth or falsity of 
the formulas is beside the point.) In analyzing substitutions into (a), it will 
be convenient to pick a name form, say “/(if)” for the function / Now if 
we substitute “cos n>” or “»r* — if” for “/(if)” in (a), we obtain respec- 
tively: 

(b) Vx[cos — x + cos x = 2 cos 0], 

(c) V4((-z) 3 - ( -a )) + (* 3 - *) = 2(0 3 - 0)]. 

Notice that in performing these substitutions, the arguments x'\ “*” 
and “0” of “/” in (a) get substituted for the name form variable “tr” in 
“cos h ” or “if* — if”. A simpler alternative analysis is available in the case 
of (b); namely, we can regard the substitution as simply of “cos” for “/”. 
In the case of (c), this alternative is not available, because our only perm- 
anent name for the function to be substituted is “if 3 — if”, which has 
“if” in it. Of course, we could introduce a name for ir 3 — if without “if” 
in the name, say “g”, by putting g(ir) = if 3 — if. Then, simply substituting 
“g” for “/” in (a), we get: 

(c') Vx[g(-*) + g(«) = 2 g(0)]. 

But we don’t want to tie up “g” permanently as a name for the function 
if 3 — if; so we have to evaluate “g(— x)'\ “g(x)” and “g(0)” in (c'). This 
brings us back to (c) with “— x", “x" and “0" substituted for the “if” of 

“if 3 — if”. 

The second example is like those we will usually have in the predicate 
calculus, and we deal with them in the same manner. For illustration, say 
we are to substitute for P(w) in P(y) D 3xP(x). (We are slightly altering 
the example preceding Theorem 15 to fit our present aims better.) We 
shall substitute a formula ordinarily (but not necessarily) containing w 
free; and for this formula we adopt “A(w)" as a temporary notation. Then 
the result of the substitution can be written A(y) Z) 3xA(x) (simply 
changing “P” to “A”); but in getting rid of the temporary notation 
“A(w)’\ the arguments y and x of P in P(y) O 3xP(x) are substituted for 
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the free occurrences of w in the formula abbreviated by “A(w)“. Here we 
are using the notational convention introduced just before Theorem 15, 
but with w in the role of the x there, and y and x successively as the r there. 

Having now described the mechanics of substituting for P(w) in 
P(y) 3 3xP(x), we show the result for five choices of A(w): 



A(w) 

A(y) 3 3xA(x) 

I. 

VzQ(w, z, w) 

VzQ(y, z, y) 3 3xVzQ(x, z, x) 

II. 

VyQ(w, y, w) 

VyQ(y, y, y) 3 3xVyQ(x, y, x) 

III. 

Q(w, u, w) 

Q(y> u, y) 3 3xQ(x, u, x) 

IV. 

Q(w, x, w) 

Q(y, x, y) 3 3xQ(x, x, x) 

V. 

VwP(w) V Q(w) 

VwP(w) V Q(y) 3 3x(VwP(w) V Q(x)) 


Of these five substitutions, we shall regard only I, 111 and V as “free”. 
In II and IV the evaluation of a line for the truth table for A(y) 3 3xA(x) 
(with a given domain D) will not always split into two parts, the first being 
the determination of a logical function (described by a supplementary 
table entered from w) as value of A(w), and the second coinciding with the 
evaluation of P(y) 3 3xP(x) from that logical function as value of P(w). 
Such a split is necessary to the reasoning to generalize Theorem 1. Briefly 
stated, the trouble in II is that the free y in P(y) becomes bound by the Vy 
of the A(vv), and in IV that the free x in the A(w) becomes bound by the 
3x in 3xP(x). We shall say this more fully. 

In the examples I-V of A(y) 3 3xA(x), the parts which originate from 
P(y) 3 3xP(x) have been underlined, leaving nonunderlined the parts 
which originate from the respective A(w)’s. Thus in II, 'the 2nd and 4th y’s 
originate from the y of P(y)3 3xP(x), while the 1st, 3rd, 5th and 6th 
originate from the y’s of the A(w), i.e. of VyQ(w, y, w). The trouble in II 
is that the first Vy (originating from the A(w)) binds not only the 1st and 
3rd y’s (as it should) but also the 2nd and 4th. Similarly in IV, the 3x 
(originating from P(y) 3 3xP(x)) binds not only the 2nd, 3rd and 5th x’s 
(as it should) but also the 4th (originating from the A(w)). 

For a “free” substitution we wish to avoid such mix-ups in the way the 
variables are bound after the substitution. So we say a substitution is free , 
if in the resulting formula (A) no nonunderlined quantifier binds an 
underlined variable, and (B) no underlined quantifier binds a non- 
underlined variable. 

Now we describe the mechanics of substitution for ions in the general 
case. Say E is a formula which contains only the distinct ions having the 
respective name forms 

(1) P,(w„ . . . , w Pi ), .... P^Wj, . . . , w„ n ) 
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(where n > I and p lt . . . ,p„ > 0), i.e. each prime component of E is of 
the form P^h, . . . , r„ ) where 1 < i < n and r„ . . . , r Jlj is a list of 
variables not necessarily distinct from each other or from the variables 
w,, w 2 > w 3 . . .. The substitution of formulas 


(2) A,(w 1( . . . , w Pi ), . . . , Aj/wj, . . . , w„J 

for (1) in E with result E* is effected by replacing simultaneously each 
prime component P,(ri, . . . , r„ ) of E by A,(r 0 . . . , r,,), where (extending 
the notation introduced in § 18) A ; (r lt .... r Pj ) is the result of substituting 
simultaneously r t , . . . , r„, for the free occurrences (if any) of w It . . . , w p . 
respectively in A,(w 1 , . . . , w } ,.). 

The rule for underlining, in terms of which we have expressed our 
definition of free, is the following: First, underline all of E outside the 
parts P^r^ . . . , r„.) to be replaced in passing to E*. Second, in the 
replacements A,(rj, . . . , r„.) for those parts, underline those occurrences 


w 





of r^ . . . , r„ which take the places of the free occurrences of w,, . . . , 

in A,(w l5 . . . , w„.) (i.e. the occurrences of ri r Pj in A(r 1; . . . , r„ ) 

resulting from the substitution for w lt . . . , w Pi in A(w 1( . . . , w p .)). 69 

When E* comes from E by a free substitution, the evaluation of any 
ine of the truth table for E* (for a given D) can be analyzed into, first, the 
determination of logical functions 

Ii(w 1( . . . , w P[ ), . . . , l„(w 1 w Pi ) 


as values of (2), and second, further steps corresponding to the underlined 
operators in E* (the underlined 3x and 23 in our illustrations). These 
further steps coincide with the evaluation of E from (3) as values of (1) 
and the same values of the free variables of E as in that line for E*. So, if 
h E, then 1= E*. Thus: 

Theorem 17. (Substitution for ions, extending Theorem 1.) Let 
E* come from E by substituting (2) for (1) as described above. If this 
substitution is free, then: If i E, then f E*. / 

Considering only the part of the preceding discussion which relates to 
A(w lt . . . , wj and A(r 1; . . . , r„) (taking n = 1 and omitting “/”), we 
have the next theorem. Indeed, under the stated condition, any line in the 
table for A(r 1( . . . , r„) receives the value we obtain by first determining 
a logical function l(w 1; . . . , w„) as value of A(Wj, . . . , w p ), and then using 


Iu testing for freedom in the substitution of (2) for (1), we only need to underline 
variable occurrences. (The rest of the underlining we did was to cite in the proof below.) 

Of course, the definition of “free substitution” can be stated without recourse to 
underlining. Cf. IM p. 156 with p. 79. 
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the value of this function when w 1; . . . , w j; have the values assigned to 
r i> • • • > V 


Theorem 18. (Substitution for individual variables.) 63 Let w l5 . . . , w„ 
1 be any ( distinct ) variables, A(w x , . . . , w„) be any formula, r,, . . . , r„ be any 
I variables not necessarily distinct from each other or from the variables 
j w 1; . . . , w p , and A(r 1; . . . , r„) be the result of substituting r 1; . . . , r fl 
simultaneously for the free occurrences of Wj, . . w,, respectively in 

A(w„ . . . , wj. If this substitution is free (i.e. the occurrences of r 1; . . . , r p 
in A(r x , . . . , r (J ) resulting from it are free), then: lfV A(w 1; . . . , w t ), then 
f A(r lt . . . , r„). 


We resume our quest for results in the validity theory of the propo- 
sitional calculus (through § 6) which we can take over for the predicate 
calculus. 


' Theorem 19. (Theorem 4 adapted to the predicate calculus.) (a ) For 
each domain D and assignment, A ~ B is t if and only if A and B have 
the same truth value. Hence: (b)° t= A ~ B if and only if, for each domain 
D, A and B have the same truth table. 


Theorem 5 and Corollary, and (a)-(£) in § 5 (and thence the chain 
method) extend to the predicate calculus, no change being necessary in 
their wordings. (Also, for each particular D, these results hold reading 
“Z>- 1=” in place of “(=”.) 

Two formulas^which are congruent (end § 16) have the same truth 
tabl es, for any given domain D. For, the alphabetical difference’3~-in-the 
bound variables will make no difference in the process of determining the 
tables. Hence, by Theorem 19(b): 

Theorem 20. If A is congruent to B, I A — > B. 

1 1 We could now give a list of valid forms of formulas, similar to Theorem 2 
1 for the propositional calculus. However we shall wait till we are ready to 
/ establish t he result s in proof theory (Theorem 26 § # 25). We give just two 
I of them now, fori h e* p u rpdsIT oFex te n d i n g Theorems 6 and 7. 

\ *82a. f “ )3xA(x) ~ Vx~ iA(x). *82b°. f ~i iVxA(x) ~ 3X - iA(x). 


Proofs. *82a. For any D, and any assignment to the ions and free 
variables of ~i i3xA(x) ~ Vx~iA(x), consider the supplementary table for 
A(x) as a logical function of x. Case 1 : this table has a t in some line. 
Then 3xA(x) receives the value t, and — i3xA(x) the value f. The supple- 
mentary table for ~iA(x) then has an f in some line (each one where A(x) 
has t); so VxmA(x) also has the value f. So by the table for ~ (or 
Theorem 19(a)), ~i3xA(x) ~ Vx~iA(x) is t. Case 2: the supplementary 
table for A(x) has only f’s. Then similarly, - i3xA(x) and Vx~ iA(x) both 
have the value t, and hence again — i3xA(x) ~ Vx - iA(x) is t. 
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*825. Using the chain method: 1= 3x~iA(x) ~ -i-i3x“iA(x) [*49] ~ 
-iVx-i - iA(x) [*82a] ~ -iVxA(x) [*49]. 

Theorems 6 (and Corollary), 7, 6a and 7a all extend now to the 
predicate calculus, when the operations * and ' are extended to include the 
interchange of V with 3. To establish Theorem 6 thus extended, *82a 
and *82b take care of moving a ~i rightward across a quantifier. For 
Theorem 7, the substitution rule is again available (as Theorem 17), and 
indeed the substitution simultaneously of — iP 4 (w lt . . . , w„) for each 
ion P J (w 1 , .... w p ) (i — 1 ,...,«) is free (quite trivially). For Theorems 
6a and 7a, we observe that the evaluation rule for V when written in terms 
of the Martian’s T, F reads just like ours for 3 in terms of t, f; and vice 
versa. In the compilation of results to be given later (Theorem 26), it will 
be possible to recognize many pairs in which (with E) one entails the other 
by duality (the extended Theorems 7, 7a). 

Exercises. 19.1. Perform each of the following substitutions. For 
each, say whether the substitution is free or not; if not free, say why. 

(a) 3zP(z, w, y), Q for P(w), Q in P(z) 3 Q. 

(b) 3xP(x, w, y), Q(w) for P(w), Q(w) in Vy(P(z) 3 Q(y)). 

(c) 3zP(z, w, y), Q(w) for P(w), Q(w) in Vx(P(x) 3 Q(x)). 

(d) P(w, v, x), Q for P(v, w), Q(w) in Vx(P(x, y) V Q(x) 3 P(y, x)). 

(e) 3zQ(z, w, w, y), VxP(v, w, x) for P(w), Q(v, w) in Vx(P(x) 3 Q(x, x)). 

(f) 3zQ(z, w, w, y), VxP(v, w, x) for P(w), Q(v, w) in VxP(x) 3 VyQ(y, y). 

(g) 3zVwP(z, w, y), Q(z, w) &3wR(w) for P(w), Q(w) in 3zP(z) 3 Q(z). 

19.2. Analyze Example 5 as an application of Theorem 17 to 

E P(r) 3 3xP(x), and show why Theorem 17 does not apply in Example 6. 

19.3. (a) Give an argument (by classifying the assignments) that 
E P & 3xQ(x) ~3x(P & Q(x)). (b) Under what conditions can you 
thence infer that t> A & 3xB(x) ~ 3x(A & B(x)) where A and B(x) are 
formulas? (c) Show that E A &3xB(x)~3x(A & B(x)) does not hold for 
all choices of formulas A and B(x). 

19.4. Find an equivalent formula containing ~i only applied directly 
to atoms. 

(a) ~ iVx{(P(x) V 3y~iQ(x, y)) & VyR(y)} 

(b) — i{-i(3xP(x) 3 VxQ(x, y)) V VxiP(x)}. 


§ 20. Model theory: valid consequence. In §7, we defined “valid 
consequence” in the propositional calculus. Adapting this definition to 
the predicate calculus in the obvious way, we say that (for m > 1) B is a 
valid consequence of A 1? . . . , A,„ (in, or by, the predicate calculus), or in 
symbols A lt . . . , A m E B, under exactly the following circumstances: 
For each domain D, in truth tables entered from a list including all the 
ions and free variables of A^ . . . , A m , B, the formula B is t in all those 
lines in which A s A m are simultaneously t. As before, it is immaterial 
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which such list of ions and variables is used. (Vv'e define “A It .... A m 
7)4 B” similarly, using a domain D with D elements; cf. § 17.) 

Example 7. Which of the four formulas ~iP(x), ~iP(y), Vx - iP(x), 
“iVxP(x) are valid consequences of ~iP{x); or in symbols, which of the 
four statements iP(x) t= ~ iP(x)”, iP(x) t — iP(y)”, i iP(x) I- Vx - iP(x)”, 

“~iP(x) t= ~ iVxP(x)” hold? We must compare the truth tables for the 
four formulas with the truth table for the assumption formula ~ iP(x) 
(which is the first formula). We begin by constructing the truth tables with 
D = { 1,2), using I,, L, 1 3 , 1 4 as before, namely: 


X 

Ir(x) 

L(x) 

Wx) 

t 4 (x) 

1 

t 

t 

f 

f- 

2 

t 

f 

t 

f 


For convenience, we enter all four tables (a)-(d) from P(x), x, y, although 
y is necessary only in investigating ‘ iP(x) f ~iP(y)”. 



P(x) 

X 

y 

(a) 

~iP(x) 

(b) 

-sP(y) 

(c) 

Vx-iP(x) 

(d) 

~iVxP(x) 

1. 

Ii(x) 

1 

1 

f 

f 

f 

f 

2. 

b(x) 

1 

2 

f 

f 

f 

f 

3. 

ll(x) 

2 

1 

f 

f 

f 

f 

4. 

ll(x) 

2 

2 

f 

f 

i 

f 

5. 

L(x) 

1 

1 

f 

f 

f 

t 

6. 

l.(x) 

1 

2 

f 

t 

f 

t 

7. 

L(x) 

2 

1 

t 

f 

f 

t 

8. 

L(x) 

2 

2 

t 

f 

f 

t 

9. 

l 3 (x) 

1 

1 

t 

t 

f 

t 

10. 

l 3 (x) 

1 

2 

t 

f 

f 

t 

11. 

l;i(x) 

2 

1 

f 

t 

f 

t 

12. 

I*(x) 

2 

2 

f 

f 

f 

t 

13. 

l 4 (x) 

1 

1 

t 

t 

t 

t 

14. 

l 4 (x) 

1 

2 

t 

t 

t 

t 

15. 

l 4 (x) 

2 

1 

t 

t 

,t 

t 

16. 

I 4 (x) 

2 

2 

t 

t 

t 

t 


We must consider those lines in which (a) has t, namely Lines 7, 8, 9, 10, 
13, 14, 15, 16. In each of these lines (a) and (d) have t. Thus 

i iP(x) h ~iP(x)” and i iP(x) 1= — tVxP(x)” both hold so far as the test 
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by D — {I, 2} goes. To prove that they hold (without restriction to 
D = 2), we need to reason that we would get the same result no matter 
what (nonempty) D is used. We leave this to the student (Exercise 20.1). 

In Line 7, (b) has f and so does (c). Thus (without having to consider other 
D's), we find that iP(x) t= — iP(y)” and iP(x) NVx - iP(x)” do not hold. 

As before (§ 7), “ A N B” is stronger than “If f A. then N B”. 

With the definition of “valid consequence” just given, Theorem 8 and 
Corollary extend to the predicate calculus. The proofs are essentially as 
before. 

Now we must notice that the foregoing definition of “valid consequence” 
(call it “(I)”) is not the only one that arises in the predicate calculus 
(and in mathematics). 

In that definition, we treated the ions and the free variables of A u . . . , 

A,„ both in the same manner as in § 7 we treated the atoms (now 0-place 
ions). Thus, we considered each ion as standing for a predicate, and each 
variable in its free occurrences as standing for a member of D. That 
predicate, or member of D, is to be the same throughout any discussion in / 
the predicate calculus, though what it is, is to be unknown to us or not / 1 
to be taken into account in the predicate calculus. So we entered tables j 
from all the ions and free variables of Aj, . . . , A,„, B in testing whether '/ 

B is t whenever A,, . . . , A m are all t. 

In the alternative definition of “valid consequence” which we shall 
presently give (call it “(II)”), we shall treat the free variables or some of 
them differently. Specifically, we shall not require the member of D which 
a free variable names to be the same throughout the discussion encompass- 
ing both A lt . . . , A,„ and B, but shall allow it to be different in different 
formulas or different applications of the same formula. 

To see how these two possible treatments of the free variables arise in 
practice, we consider some examples in the language of informal mathe- 
matics. The story is really a familiar one to students of elementary algebra 
from the difference between (I) a conditional equation and (II) an identical 
equation. Examples of conditional equations are (1) x- — 2x — 3 = 0 
and (2) y — x + 1 ; of identical equations, (3) x + y = y + x and 
(4) ( x + l) 2 = x 2 + 2.r + I. From (I) we should not infer that 
2 2 — 2 • 2 — 3 = 0 or y 2 — 2y — 3 = 0, though we can infer 
(x — 3)(.r + 1) = 0, thence x — 3 = 0 V a; + 1 =0, and thence 
x = 3 V x = — 1. From (3) we can infer 3 + 1 = 1 + 3 and 
(a: + z) + 2x = 2x + (x + z). 

We say that in (1) and (2) the. v ariables have the conditional inte r- 
pretati on (1), in (3) and (4) th e g enerali ty interpretation (II). Under the 
conditional interpretation of x in an assumption formula A(x) containing 
x free, any conclusions we draw containing x free should refer to the same 
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member of D as in the assumption A(x), which expresses a “condition 
on x"; so the conclusion applies to any member of D satisfying the 
condition. Under the generality interpretation, we are justified in inferring 
whatever follows from A(x) being true for all x or true “identically” or 
true “in general”. In the inferences from (1) we can say the variable “at” is 
“held constant” since it stands for the same number throughout the 
deductions. In the inferences from (3), “x" and “y” are “allowed to 
vary”, since their values may be changed. 70 

There would not be any problem of distinguishing these two inter- 
pretations, if only bound variables were used. But the use of free variables 
is very convenient, as is shown by their frequence in mathematical 
writings. 

The result of the above inferences from (1) can be written with x 
bound, after discharging the assumption (1), thus: 

(a) Vx(x 2 -2i-3 = 03x = 3Vx = — 1). 

In (3) we could have used bound variables in the assumption itself (thus : 
\/x Vy(x + y — y +x)) ; and the first result can be written, after discharging 
the assumption, thus: 

(b) V*Vy(* + y = y + x) D 3 + 1 « 1 + 3. 

Note that in (a) the parentheses close after the 3, in (b) before the 3. 

Which interpretation (I) or (II) to use is for us to choose in each case we 
propose to infer consequences from assumptions, depending on the role 
we intend the assumptions to have. The choice can be made independently 
for each free variable of each assumption. 

An inappropriate choice will not result in error, unless we subsequently 

,0 In mathematics, letters are often classified as “constants" and "variables”. Close 
inspection shows that the distinction in use is relative to a context (whatever the term- 
inology; cf. 1M p. 150). A given letter is used as name for some object, and throughout 
some context every occurrence of it is as a name for the same object. From outside this 
context, it is indicated that the object may be any one (some one, etc.) of the members 
of some collection or set D. Thus the letter is “held constant” within the context, while 
from outside it may be “varied”. 

It is important to know exactly the duration of the context, irrespective of how often 
the letter occurs in it; e.g. Vx(A(x)3B) and VxA(x) 3 B have different meanings 
even when x does not occur free in B. As we have illustrated, in our logical formulas 
and likewise in formulas as written in mathematics, when a variable appears free, it 
names any member of D, but there are two possibilities as to the duration of the context; 
(I) the context is an extended discussion, specifically here an entire deduction, (II) the 
context is just the whole formula. 

The universal quantifier Vx is necessary in logic as a device by which the context 
throughout which a variable x names the same arbitrary member of D can be made less 
than the whole formula. 
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misstate what we have done. For example, using the generality inter- 
pretation of a; in (1) (which is inappropriate), we can infer 
2 2 — 2 • 2 — 3 = 0, and thus establish 

(c) Vx(x- - 2* - 3=0) 3 2 2 - 2- 2- 3 = 0, 

which is a correct but uninteresting result. Only if we claim that 
22 — 2- 2- 3 = 0 follows from (1) under the conditional interpretation, 
and thus claim to establish 

(d) VzOc 2 - 2x - 3 = 0 3 2 2 - 2 • 2 - 3 = 0), 

do we make an error. From (d) by specializing to x = 3 and simplifying, 
0 = 03—3 = 0, and thence —3 = 0. 

The valid consequence relationship (I) formulated at the beginning of 
this section and illustrated by Example 7 corresponds to using the 
conditional interpretation of all the free variables of the assumption 
formulas A x , . . . , A m . The alternative one (II) to be formulated next 
corresponds to using instead the generality interpretation of some or all 
x u . . . , x a of those variables. To keep matters as simple as possible, we 
shall suppose here that the variables x t , . . . , x ff having the generality 
interpretation in any of A x , . . . , A,„ have the same interpretation in all 
of A t , ... , A m which contain them free. 71 

The intervening discussion should have made it clear that to get (II) 
from (I), all we need to do is to prefix universal quantifiers Vx x . . . Vx a 
to the assumption formulas A x , . . . , A m (with each whole one of A x , . . . , 
A m as the scope of the prefix Vx x . . . Vx a ) before applying (I), i.e. before 
constructing the truth tables called for under (I). 

In formulating (II) “officially”, we shall use a concise notation for the 
prefixed universal quantifiers. 

We call a f ormula A closed, if. it contains na-vamhleiXree (i.e. it has 
at most bound occurrences of variables); otherwise, open. By the closure 
of A, abbreviated “VA”, we mean the closed formula Vz x . . . Vz„A (i.e. 
Vz x . . . Vz„(A)) where z x , . . . , z„ are the distinct free variables of A, 
taken for definiteness in order of first free occurrence in A. If A is closed 
(i.e. p = 0), VA is simply A. “VA 3 B” means (VA) 3 B, etc. 

Now let V' (— Vx x . . . Vx„) be the operation of closure with respect to 
(just) x x , . . . , x a , which we get from V by omitting from Vz x . . . Vz^ the 
quantifiers other than on x 1? . . . , x,. For example, if x x , . . . , x a is 
Xj, X,, x 3 (<7 = 3), and A contains in order of first free occurrence exactly 
the distinct variables y x , x 3 , y 2 , y 3 , x i, then VA is VyiVxjVyjVysVXjA and 
V'A is Vx 3 Vx,A. Thus V' is Vx x . . . Vx a apart from the order of quantifiers 
and omissions of quantifiers on variables not free in the scope. Where it 

” In IM §§ 22-24, (II) is treated in proof theory without this restriction.” 
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docs not matter for our purposes (as is usually the case), we may not 
bother to distinguish carefully between V' and Vx t . . . Vx a . 

Now we say B is a valid consequence of A 1( . . . , A m (in, or by, the 
predicate calculus) holding constant all variables except Xj, . . . , x„ or 
allowing \ u ... ,x Q to vary or with x 1( . . . , x, general, or in symbols 
Aj, . . . , A„, k x > " x « B, exactly if V'Ai, . . . , V'A m 1= B. Thus 
A 1( . . . , A,„ t= x > x « B holds exactly if, for each domain £>, in truth tables 
entered from a list including all the ions and free variables of 
V'A l5 .... V'A„„ B, the formula B is t in all those lines in which 
V'A 1; .... V'A W are simultaneously t. 72 

This definition and notation need not be restricted to the case each of 
x t , . . . , x„ occurs free in some of A„ . . . , A m ; blit any variable not so 
occurring can be added or removed without affecting the meaning. The 
order of listing the superscripts, and the presence or absence of repetitions, 
is likewise immaterial. 

Example 8. Which of the formulas ~iP(x), “iP(y), Vx - iP(x), -iVxP(x) 
are valid consequences of~iP(x) with x general; or in symbols, which of 
the four statements “-iP(x) k x -iP(x)”, “-iP(x) k x -iP(y)”, 

“ — iP(x) k x Vx-tP(x)”, "-iP(x) k x -]VxP(x)” hold? We must now com- 
pare the tables (a)-(d) with the table for the closure Vx~iP(x) of the 
assumption formula -iP(x) (which closure is the third formula). Thus we 
must now consider those lines in which (c) has t, namely Lines 13, 14, 15, 
16. In each of these lines (a), (b), (c)and (d) have t. This is for D — {1, 2}; 
but we easily convince ourselves of the like for any D. So 
“ — >P(x) k x — iP(x)”, “-iP(x) k x “iP(y)’\ “-tP(x) k x Vx~ iP(x)” and 

iP(x) k x ~ iVxP(x)” all hold. 

Say A contains only x free. The statement “A k B” is stronger than 
"‘A k x B”; i.e. if A k B then A k x B, but not in general conversely. To 
establish this, consider any D, and take tables entered from x and the 
other quantities required to evaluate A and B. For A k x B to hold, it is 
only required that B be t in those lines in which the other quantities cause 
VxA to be t, i.e. cause A to be t for all values of x (in Example 8, where A 
is ~iP(x), Lines 13, 14, 15, 16). For A k B to hold, it is required that B 
be t in all lines, entered from a value of x and the other quantities, in 
which A is t for that value of x. So we have to consider all the lines that 
had to be considered for A k x B and in general others as well (in Example 
7, Lines 7, 8, 9, 10 as well). 

Generalizing this to any number of assumption formulas and variables: 

72 The phrase “except x 1; . . . , x,,” merely removes the variables x,, . . . , x„ from the 
assertion about all variables being held constant. It does not exclude the possibility that 
B is a valid consequence of A with all variables held constant or intermediate possi- 
bilities. 
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If A l( A m k x > • x « B, then A„ . . . , A m h x >- x *’W ~*r B ; but not in general 

conversely. Of course, the converse Joes hold when x a+1 , ... ,x r do not occur 
free in any of A„ . . . , A m . 

The valid consequence relationship (I) (“A 1( . . . , A m b B”) is the one 
which will have the principal role in this book. When we say “B is a valid 
consequence of A,, , A,„” we shall mean that one, though for emphasis 
we may say more fully “B is a valid consequence of A 1( . . . , A,„ holding 
all (free) variables (of A lt .... A m ) constant”. 

Exercises. 20.1. Supply the reasoning for Example 7(a) and (d). 

20.2. Show that: 

(a) Not R 3 P(x) 1= R 3 VxP(x). (b) R 3 P(x) h x R 3 VxP(x). 

(c) Not P(x) 3 R b 3xP(x) 3 R. (d) P(x) 3 R b x 3xP(x) 3 R. 

20.3. Show that: V'A„ . . . , V'A m b B if and only if 
V'Aj, . . . , V'A m b V'B. 

20.4. Note that Exercise 7.6 holds for the predicate calculus. Now use 
Exercise 20.3 to infer that it also holds with “t= x i " x «” in place of “b”. 

20.5. Show that for any variable x and formula A: (a) VxA b A. 

(b) A b x VxA. (c) “A b VxA” does not hold in general. 

20.6. Show that: A t= x B if and only iff VxA 3 B, and also if and only 
if b Vx(VxA 3 B), and also if and only if b VxA 3 VxB; but A b B if 
and only if b A 3 B, and if and only if b Vx(A 3 B). (Cf. Exercise 17.6.) 

20.7. Show that, when C does not contain x free: A(x) b x C if and 
only if b VxA(x) 3 C; but A(x) b C if and only if b 3xA(x) 3 C. 

20.8*. Does Theorem 5 § 4 hold in the predicate calculus with 
“A^ . . . , A m b” in place of “(=”? (For m = 0, cf. end § 19.) 

§ 21. Proof theory: provability and deducibility. To establish the 
proof theory of the predicate calculus, we begin with the axiom 
schemata and rule of inference oi the propositlonaT calculus (i.e. Axiom 
Schemata la-lOb, and modus ponens or the 3-rule; cf. §9). Of course, 
these are now to be used with the present sense of formula (§ 16). 

To these, we add two new axiom schemata, VxA(x) 3 A(r) (the 
V -schema) and A(r)3 3xA(x) (the 1-schema), where r is free for x in 
A(x) etc. (cf. Theorem 15 § 18). That is, the axioms are now to include 
also any formulas having either of these forms. 

We also add two new rules of inference : the 'i-rule allows us to pass from 
C 3 A(x) to C 3 VxA(x), and the 3 -rule from A(x) 3 C to 3xA(x) 3 C, 
when C does not contain x free (cf. Theorem 16). 73 

73 These four simple postulates for the quantifiers are due to Berna ys (according to 
Hilbert and Ackermann 1928 p. 54). But as in §9 (following von Neumann 1927) we 
are using axiom schemata, instead of particular axioms with a postulated substitution 
rule. (There were defects in all the early formulations of the substitution rule for the 
predicate calculus; cf. Church 1956 pp. 289-290.) 
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The definition of a proof B x , . . . , B z (of B,), and of B is provable or in 
symbols I- B, are as before (§ 9), allowing that now we have two more 
axiom schemata and two more rules of inference. 

The definition of a deduction B 1( . . . , B, (of B,) from A l5 . . . , A m is 
likewise as before. 

Moreover, we say that in such a deduction dll (free) variables (of 
A], . . . , A,.,) are held constant, if the following is the case: the V-rule and 
the 3-rule are not applied with respect to a variable (as the x of the rule) 
occurring free in A lt . . . , A,„ except preceding the first occurrence of 
Aj, . . . , A,„ (as such) in the deduction. (We can disregard occurrences of 
A|, . . . , A,„ in the deduction B t , . . . , B t which are justified otherwise 
than as assumption formulas.) 74 

Thus, if we say that in the deduction Bj, . . . , B, from A t , . . . , A, n all 
variables are held constant, we mean the following: Let B t be the first of 
Bj, . . . , B, which is justified as an assumption formula, if any of B 1; . . . ,B, 
is so justified; otherwise, there is no B fc and B*._j shall be B ( . In the part 
Bi, • . • , Bj.^ of the deduction (if it exists), any variable may be the x of 
an inference (with conclusion in that part) by the V-rule or the 3-rule. 
In the part B m , . . . , B, (if it exists), only a variable not occurring free 
in A„ . . . , A m may be the x of an inference (with conclusion in that part) 
by the V-rule or the 3-rule. 

If there is a deduction of B from Aj, . . . , A m holding all variables 
constant, we say that B is deducible from Aj, . . . , A m ( holding all variables 
constant), and write A 1( . . . , A„, I- B. In this book we usually omit the 
words “holding all variables constant”. 

We say that B is deducible from A 1( .... A m holding constant all 
variables except x 1( . . . , x, or allowing x x , . . . , x„ to vary or with . . . , x ? 
general, and write A u . . . , A,„ b* 1 " B, if V'Aj, . . . , V'A„, b B in the 
foregoing sense, where V' (~ . . . Vx 0 ) indicates closure with respect 

to x lt . . . , x,. The significance of this notion should be evident from the 
parallelism of “ b” here to “f ” in § 20. In this book, we shall scarcely use 
“ "more thaTTasTa'shorthand for prefixing VXj . . . Vx a to each of 

the assumption formulas. 75 

71 Thus this definition applies to the deduction B, B, together with an analysis 

of it, i.e. a reason for the inclusion of each formula in it. 34 

73 In IM §§ 22 ff. the derived rules are formulated so as to provide for manipulations 
of statements using “ t” with superscripts allowed. We are foregoing that here, in order 
to simplify the present treatment. 

In IM, “ f” is used in some passages with a “systematic ambiguity” to stand for a 
notion equivalent to what we write here as “ f x i--V\ with various lists of variables 
x,, .... x, (q > 0), and the word “deducible” is used there similarly. Thus rules with 
“ H” in hypothesis and conclusion can be applied there introducing superscripts on 
“ h” in tiie hypothesis, if then the superscripts are carried forward to the conclusion; 
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We can take over automatically most of the results of the proof theory 
of the propositional calculus. For, all proofs and deductions (with 
formulas in the present sense) which we can construct or show to exist in 
the propositional calculus (i.e. using only Axiom Schemata la-lOb and the 
3-rule) are proofs and deductions (with all variables held constant) in 
the predicate calculus, since in the predicate calculus all the same reasons 
are still available to justify each formula in them (and the V-rule and 
3-rule are not used). In particular, if 1- B in the propositional calculus, 
then h B in the predicate calculus. If A x , . . . , A m 1- B in the propositional 
calculus, then A x , . . . , A m f B in the predicate calculus. (In the terminol- 
ogy of § 13, direct rules established for the propositional calculus hold 
ipso facto in the predicate calculus.) 

But from “If A IB, then h A 3 B” in the propositional calculus 
(Theorem 1 1 (a)), we can not immediately infer the like in the predicate 
calculus, but only “If A H B in the propositional calculus, then iADB / 
in the predicate calculus”. (Subsidiary deduction rules do not auto- 
matically extend.) 

Example 9. The following is a deduction of R 3 VxP(x) from 
Vy(R 3 P(y)). 

1. VyP(y) 3 P(x) — V-schema. T.r 

2. VyP(y) 3 VxP(x) - V-rule, l.U 

3. Vy(R 3 P(y)) — assumption formula. 

4. Vy(R 3 P(y)) 3 (R 3 P(y)) - V-schema. 

5. R 3 P(y) - 3-rule, 3, 4. " 

6. R 3 VyP(y) — V-rule, 5. 

7. (VyP(y) 3 VxP(x)) 3 {R 3 (VyP(y) 3 VxP(x))} — Axiom Schema la. 

8. R 3 (VyP(y) 3 VxP(x)) — 3-rule, 2, 7. 

9. {R 3 VyP(y)} 3 {{R 3 (VyP(y) 3 VxP(x))} 3(R3 VxP(x)}} 

— Axiom Schema lb. 

10. {R 3 (VyP(y) 3 VxP(x))} 3{R3 VxP(x)} - 3-rule, 6, 9. 

11. R 3 VxP(x) - 3-rule, 8, 10. 

Since no variables are free in Vy(R 3 P(y)), all variables are held constant 
in this deduction. Thus Vy(R 3 P(y)) h R 3 VxP(x). 


cf. IM p. 103. In l--statements standing by themselves, the superscripts are generally 
shown (when applicable) in IM, as here. 

In IM, instead of using just one list x,, . . . , x, of variables with the generality inter- 
pretation in all the assumption formulas A,, . . . , A„, a different list x n x,„ ( 

can be selected to have the generality interpretation in each respective assumption 
formula A, (/ = 1, . . . , m ). To do this here, we can simply employ VjA,, . . . , V^A„ 
(where V( ~ Vx fl . . . Vx„.) instead of V'Ai, . . . , V' A m . 

The equivalence of the present treatment of variables having the generality inter- 
pretation to that in IM follows from IM Lemma 8a p. 104. 
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If we omii from 111 the two formulas 3 and 4, and justify 5 instead by 
“assumption formula”, wc obtain a deduction of RdVxP(x) from 
R3P(\). But we cannot say all variables are held constant. For, the 
V-rule is used at 6 with respect to the variable y; and the variable y occurs 
free in the assumption formula R D P(y) used at 5 (above 6). Thus we 
are not justified in saying ”R D P(y) F R D VxP(x)”. In fact, this is not 
true, since b\ the theory to follow (§§ 22, 23) it could only be true if 
“R D P( v) r R D VxP(x)” were true; and that is not the case by Exercise 
20.2 (a) (where it is immaterial whether the free variable is x or y). 

Example 10. For any formula A, the following is a deduction of A 
from VxVyA. (By our convention in § 16, x and y are distinct variables.) 

1. VxVyA ■ assumption formula. 

2 . VxVyA D VyA - V-schema, with (the present) x, VyA, x as the 
x, A(x). r of the schema. 

3. VyA — D-rule, I, 2. 

4. VyA DA- V-schema with y. A, y as the x, A(x), r. 

5. A — D-rule, 3, 4. 

There are no applications of the V- or 3-rule, so all variables are held 
constant. Thus VxVyA V A. More generally, for any list of variables 
x, x„, Vx t . . . Vx (/ A F A. 

In the predicate calculus, especial care is necessary to be ?ure that the 
r is free for the x in the A(x) each time we use the V-schema or the 
3-schema (cf. Theorem 15), and that the C does not contain the x free 
each time wc use the V-rule or the 3-rule (cf. Theorem 16). 

Example 10 (concluded). In the two applications of the V-schema, the r 
is free for the x in the A(x), since each time the r is the x itself: the resulting 
occurrences of the r in the A(r) are simply the original free occurrences 
of the x in the A(x). Here the r and x are x in Step 2, and y in Step 4. 

Exampei II. In the left column below, we give a proof of 
Vx3wP(x, w, /) Vv3wP(y, w, z) (so F Vx3wP(x, w, z) Vy3wP(y, w, z)). 
In the right column, we generalize this to a proof of VxA(x) — VyA(y) 
under suitable conditions on (he formula A(x) and the variables x and y 
(to be stated in a moment). The reasons given are the same for both 
columns. The reader should first check their correctness for the left 
column (Exercise 21.1). 

1. Vx3u P(x, w, /) D 3wP(y, w, z) V-schema. 1. VxA(x) D A(y). 

2. Vx3wP(x, w, /) D Vy3wP(y, w, z) V-rulc, I 1 ” 2. VxA(x) D VyA(y). 

3. Vy3wP(\. w. z) D 3wP(x, w, z) - V-schema. 3. VyA(y) D A(x). 

4. V\3wP(y. w. z) D Vx3wP(x, w, 7 ) - V-rule, 3. 4. VyA(y) D VxA(x). 


7. V\3.vP(\, w, /) — Vy3wP(y, w, z) --- using 7. VxA(x)~ VyA(y). 
Axiom Schema 9a with 2 and 4 and the D-rule. 
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For the right column, we shall suppose that x is any variable, A(x) is 
any formula, y‘ is any variable (not necessarily distinct from x) such 
that 

(i) y is free for x in A(x), 

(ii) y docs not occur free in A(x) (unless y is x), 

and A(y) is the result of substituting y for the free occurrences of x in 
A(x). (Recall that (i) means that none of the occurrences of y resulting 
from this substitution is bound.) 

Let us check the requirements that 1-7 be a proof in the right column. 
They are easily verified when y is x. Now take the case y is not x. For 1, y is 
the r, and A(y) is the A(r), for the application of the V-schema; and using 
(i) the requirements of the V-schema are met. For 2, VxA(x) is the C, 
and y is the x, of the V-rule; and by (ii) the x does not occur free in the 
C; so again the requirements are met. For 3, we need to see that 
VyA(y)3A(x) is of the form VyB(y) O B(r), where B(r) comes by 
substituting r for the free occurrences of y in a formula B(y) and r is free 
for y in B(y) (then y and B(y) will be the x and A(x) of the V-schema). 
We got A(y) by substituting y for the free occurrences of x in A(x); and 
by (i) each of the occurrences of y in A(y) resulting from this substitution 
is free, while by (ii) there are no free occurrences of y in A(x); thus A(y) 
has free y’s exactly where A(x) had free x’s, and hence A(x) does come 
by substituting x (as the r) for the free occurrences of y in A(y) (as the 
B(y)). Furthermore none of the occurrences of x which enter A(x) by this 
substitution are bound, as they are simply the original free occurrences of 
x in A(x). So 3 is justified. For 4, we need merely verify that VyA(y) 
contains no free occurrences of x. This is so because A(y) comes by 
substituting y for all the free occurrences of x in A(x). 

In this example, the left column illustrates the right for the case the 
A(x) is 3wP(x, w, z). 

Example 12. Now suppose instead that the A(x) of the right column 
is 3yP(x, y, z) (with its x and y as the x and y). Then (i) would fail, and 
the V-schema would not be applicable at Line 1 . At Line 1 VxA(x) D A(y) 
would be Vx3yP(x, y, z) 3 3yP(y, y, z). In fact, this formula is unprovable. 
For, by its truth table for D = {I, 2}, it is not valid, and hence by the 
theory below (Theorem 12 extended to the predicate calculus in §23), it 
is unprovable. 

Similar examples were given in § 18, where the question whether the 
conditions of Theorem 15 are met is now synonymous with whether the 
V-schema or the 3-schema applies. 

Example 13. Suppose instead that the A(x) is 3wP(x, w, y). Then (ii) 
would fail. Line I is correct (i.e. the V-schema applies). But at Line 2, the 
application of the V-rule with result Vx3wP(x, w, y) D Vy3wP(y, w, y) 
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would be incorrect, since the x and C of the application would be y and 
Vx3wP(x, w, y), with the C containing the x free. Indeed, 

Vx3wP(x, w, y) D Vy3wP(y, w, y) is unprovable. 

Theorem 10 (a) and Corollary in § 9 can be reproved now; but 
Theorems 9 and 10 (b) we postpone to the end of § 22. 

Exercises. 21.1. Check the left column of Example 11. 

21.2. Is the list of formulas a deduction from its first formula? If not, 
find and explain the mistake. 

(a) 1. 3zVxP(y, x, z) — assumption formula. 

2. VxP(y, x, z) D 3wVxP(w, x, z) — 3-schema. 

3. 3zVxP(y, x, z) 13 3wVxP(w, x, z) — 3-rule, 2. 

4. 3wVxP(w, x, z) — D-rule, 1, 3. 

(b) 1. 3xVyP(x, y, z) — assumption formula. 

2. VyP(x, y, z) z>3wVyP(w, y, z) — 3-schema. 

3. 3xVyP(x, y, z) D3wVyP(w, y, z) — 3-rule, 2. 

4. 3wVyP(w, y, z) — D-rule, 1, 3. 

(c) 1. 3yVyP(y, y, x) — assumption formula. 

2. VyP(y, y, x) I33zVyP(z, y, x) — 3-schema. 

3. 3yVyP(y, y, x) D 3zVyP(z, y, x) — 3-rule, 2. 

4. 3zVyP(z, y, x) — D-rule, 1, 3. 

21.3. State conditions on x, A(x), y, A(y), C under which the result of 
Example 9 generalizes to Vy(C D A(y» I- C D VxA(x). 

21.4. Add to the following proof to obtain a deduction of 
3xP(x) D3xQ(x) from Vx(P(x) D Q(x)) (check all requirements). 

( proof given by 
*2 in Theorem 2 
with Theorem 14. 

State the result which this construction establishes, using the symbol “ b”. 
Does the result hold good when P(x), Q(x) are changed to any formulas 
A(x), B(x)? 

21.5. Show that, under the conditions of Theorem 15: 

(a) VxA(x) h A(r). (b) A(r) 1- 3xA(x). 

§ 22. Proof theory: the deduction theorem. We now reprove the 
deduction theorem (Theorem 1 1 § 10) for the predicate calculus. Essential 
use is made of the condition, now incorporated into the meaning of the 
hypothesis “A„ .... A m _!, A m h B”, that in the given deduction 

(a) Bj, . . . , B, 

of B from A„ . . . , A m ~ t , A m all variables are held constant. 

We use the former proof (§ 10) with some modifications and additions. 
(The treatment is illustrated by Example 14 below.) 
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If the last assumption formula A,„ is not used (as such) in the given 
deduction (oc), we shall now construct the resulting deduction simply by 
adding the following at the end of the given deduction: 

/+!'. B, 3 (A,„ 3 B,) — Axiom Schema la. 

1+2'. A„, 3 B, — modus ponens (3-rule), /', l+l'. 

Now suppose the last assumption formula A,„ is used (as such) in the 
given deduction (a); say the first use of it is B,„ i.e. this is the first of 
the formulas B tl . . . , B, which is justified in (a) on the ground of its being 
the last assumption formula A m . We now begin by prefixing “A,„ 3” only 
to the formulas B„, . . . , B, in the given deduction, to obtain 

(|3) Bj B„_ lt A m 3B„,...,A,„D B,. 

Now insertions will be required preceding A,„ 3 B, only for / = 
instead of for i = 1, . . . , /. We again specify the method of making these 
insertions by cases, depending on the reason given for the inclusion of 
B, in (a). 

In Cases 1-3 the former treatment applies unchanged. In Case 4, that 
B, with i > n comes from B,, and B ; ,(£, h < /) by modus ponens, we 
formerly had to provide steps to lead from A,„ 3 B,, and A,„ 3 B A to 
A,„ 3 B„ where B„, B (t , B* are of the respective forms A, A 3 B, B. 
(In § 10, these steps were left to the student in Exercise 10.1, but here they 
are illustrated in Example 14 by 12'— 14' and by 1 7'— 1 9'.) Now, if g < n 
we must first supply the following two steps to get A„, 3 B„: 

B,, 3 (A,„ 3 B, ; ) — Axiom Schema la. 

A„, 3 B„ — modus ponens, g', — . 

Similarly, if/; < //. (This is illustrated by 15', 16' in Example 14.) After 
these preliminaries, the former treatment of Case 4 applies. Tli 

Case 5: B, with / > n conies from a preceding formula B y (g < /) 

by the V-rule. So B,„ B, are of the respective forms C 3 A(x), 

C 3 VxA(x), where C docs not contain x free. If g < /;, we first supply 
two steps to get A m 3 B,,, as in Case 4. Remember that / > n, i.e. the 
application of the V-rule comes after the first use B„ of the assumption 
formula A,„ (as such). Hence x does not occur free in A m ; this is by the 
hypothesis that all variables are held constant in (a), with the definition in 
§21 of a deduction in which ail variables are held constant. By the 
stipulation for the V-rule, C docs not contain x free. Hence A„, & C does 
not contain x free. This fact is used in justifying the new application of 
the V-rule at A- 2 +T below (top p. 114). 


7,1 The changes so far described from the plan in § lOcould have been used there, and 
would often have been made the resulting deductions shorter. 
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A "‘ 3 B »’ i e ' A ”‘ 3 ^ 3 A W) ) deduction given 

, | by Example 7 § 10 

/c.;. A,„ & C 3 A(x) J y F 

k.,+ I'. A,„ & C 3 VxA(x) — V-rule, l<„. | deduction given 

' ‘ ‘ f by Exercise 10.3. 

A-.;. A 3 (C 3 VxA(x)), i.c. A,„ 3 B, J 

Casi. 6: B, with i > n comes from a preceding formula B„ (g < i) by 
the 3-rule. Left to the student (Exercise 22.1). 

We have now seen how to construct a particular deduction of A,„ 3 B 
from Aj, . . . , A m _j. In this “resulting deduction”, all the applications of 
the V-rule and the 3-rule are on the same respective variables, and occur 
in the same order relative to the uses of A 1; . . . , A m _ 1( as the applications 
in the given deduction. Hence, since all variables are held constant in the 
given deduction, all are held constant in the resulting deduction. Therefore 
An ... , A m j h A„, 3 B, as was to be proved. 

Exampll. 14. As the left column illustrates, Vx(P(x) 3 Q(x)), 

VxP(x) h VxQ(x). So by the theorem, Vx(P(x) 3 Q(x)) h VxP(x) 3 VxQ(x). 
fhe right column shows the deduction of VxP(x) 3 VxQ(x) Irom 
Vx(P(x) 3 Q(x)) resulting by our uniform method from the deduction of 
VxQ(x) from Vx(P(x) 3 Q(x)), VxP(x) given in the left column. Because 
the second assumption formula enters only at 4, the prefixing of 
“VxP(x) 3” begins there. 

1. Vx(P(x) 3 Q(x)) - 1st ass’n. f. 1'. Vx(P(x) 3 Q(x)) - same reason. 

2. Vx(P(x) 3 Q(x)) 3 2'. Vx(P(x) 3 Q(x)) 3 

(P(x) 3 Q(x)) - V-schema. (P(x) 3 Q(x)) - same reason. 

3. P(x) 3 Q(x) - m.p., I, 2. 3'. P(x) 3 Q(x) - same reason. 

• • • 1 Example 

4. VxP(x) - 2nd ass’n. f. 8'. VxP(x) 3 VxP(x)j 4 § 9. 

9'. VxP(x) 3 P(x) — V-schema. 

10'. (VxP(x) 3 P(x)} 3 

{VxP(x) 3 (VxP(x) 3 P(x))} 

— Axiom Schema la. 

5. VxP(x) 3 P(x) - V-schema. 1 1'. VxP(x) 3 (VxP(x) 3 P(x)) 

— modus ponens, 9', 10'. 

12'. (VxP(x) 3 VxP(x)) 3 

{{VxP(x) 3 (VxP(x) 3 P(x))} 3 
(VxP(x) 3 P(x)}} - Ax. Sch. lb. 

13'. {VxP(x) 3 (VxP(x) 3 P(x))} 3 
(VxP(x) 3 P(x)} - m.p., 8', 12'. 
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6. P(x) — modus ponens, 4, 5. 14'. VxP(x) 3 P(x) — m.p., 1 1', 13'. 

15'. (P(x) 3 Q(x)} 3 {VxP(x) 3 
(P(x) 3 Q(x))} — Ax. Sch. la. 

16'. VxP(x) 3 (P(x) 3 Q(x)) 

— modus ponens, 3', 15'. 

17'. (VxP(x) 3 P(x)} 3 

{{VxP(x) 3 (P(x) 3 Q(x))} 3 
(VxP(x) 3 Q(x)}} - Ax. Sch. lb. 
18'. {VxP(x) 3 (P(x) 3 Q(x))} 3 

{VxP(x) 3 Q(x)} — m.p., 14', 17'. 

7. Q(x) — modus ponens, 6, 3. 19'. VxP(x) 3 Q(x) — m.p., 16', 18'. 

8. Q(x) 3 ((P 3 P V P) 3 22'. VxP(x) 3 (Q(x) 3 

Q(x)) — Axiom Schema la. ((P 3 P V P) 3 Q(x))} 

9. (P 3 P V P) 3 Q(x) 25'. VxP(x) 3 

— modus ponens, 7, 8. {(P 3 P V P) 3 Q(x)} 

k[. VxP(x) & (P 3 P V P) 

3Q(x) 

*,+ !'. VxP(x) & (P 3 P V P) 

3 VxQ(x) — V-rule, k' r 
• » • 

10. (P3PVP)3 VxQ(x) k' r VxP(x) 3 {(P 3 P V P) 

— V-rule, 9. 3 VxQ(x)} 

• • • 1 like 

11. P3PVP- Ax. Sch. 5a. ^+3'. VxP(x) 3 (P 3 P V P) | 9'-l 1'. 

• • • 1 like 

12. VxQ(x) — m. p., II, 10. k 2 + 6'. VxP(x) 3 VxQ(x) / 12'— 14'. 

Corollary Theorem 11 follows from the theorem as before. 

We now observe that Theorem 9(i) and (ii) hold for the predicate 
calculus with no change in statement. 

However (ii) requires a new method of proof. If we simply combined 
the given deductions as in § 9, it could happen that applications of the 
V-rule or the 3-ruie to variables which occur free in A„ . . . , A m would 
come under the first use of Aj, . . . , A,„ in the resulting deduction. By an 
obvious rearrangement we could mend this with respect to such appli- 
cations coming from the p given deductions of Bj, . . . , B p respectively 
from A 1( . . . , A,„. A more serious difficulty is that such applications, 
with respect to variables occurring free in A lf . . . , A m but not in 
Bn ... , B^ might come from below the first use of the assumption 
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formulas B,, . . . , B„ (as such) in the given deduction of C. This difficulty 
we shall overcome by use of the deduction theorem. 

As the hypotheses of (ii) we have: (1) A„ . . . , A,„ f B, (/ = I, . . . ,/;) 
and (2) B„ . . . , B„ h C. Applying Corollary Theorem ll,. d to (2): 77 
(3) f Bi 3 (. . . (B„ 3 C). . .). Now we build a deduction of C from 
A lt .... A,„ as follows. Take the p deductions of B„ . . . , B ( , respectively 
from A,, . . . , A m (which exist by (1)); write them consecutively; and then 
move to the front of the combination the part of each which precedes the 
first use in it of an assumption formula (as such). To the sequence of 
formulas thus obtained, prefix the proof of B, 3 (. . .(B„ 3 C). . .) 
(which exists by (3)). Finally, suffix applications of modus ponens which 
lead from B t 3 (. . ,(B„ 3 C) . . .), Bj, . . . , B,, to C. 

In the deduction of C from A,, .... A,„ thus constructed, there are no 
applications of the V-rule or the 3-rule coming below uses of the assump- 
tion formulas (as such) except ones which came in from such applications 
in the deductions given by (1). By the hypothesis (implicit in our using 
“ I-" without superscripts) that all variables are held constant in those 
deductions, the variables for these applications do not occur free in 
A u . . . , A,„. Thus in our deduction all variables are held constant, so 
Ai A,„ h C. — 

Having reinstated Theorem 9, we now have all the general properties 
of h (written without superscripts) that were noted in § 13. 

Exercises. 22.1. Treat Case 6 in the proof of the deduction theorem. 

22.2. Is the deduction theorem applicable to 3-12 of Example 14 as the 
“given deduction” (with 3 justified by “assumption formula”)? 

22.3. Prove Theorem 10, >d (b). 

§ 23. Proof theory: co nsistenc y, introduction and elimination 
rules. The corollaries to Theorems 10,, d and 1 l,. d in §§ 21, 22 reduce the 
notion of deducibility “A„ . . . , A, n I- B” to that of provability “ h E” 
in a manner parallel to the reduction by Corollary Theorem 8,, d in § 20 of 
valid consequence “A,, . . . , A m b B” to validity “h E”. 77 (Likewise, 

using V', we have parallel reductions of “A, A„, |- x i x » B” and 

“A,, . . . , A, n b x » - x « B”.) So now to show the equivalence of mod el 
theory and proof theory for the predicate calculus, it remains to show that 
V E if and oniyTf F E ' 

The “if” part is easy. Theorem 12 and Corollary of § 11 hold again. 
The proofs are as before, using now la-lOb in Theorem 2,> d , and Theorems 

” In citing theorems taken over, or slightly adapted, from the propositional calculus, 
to the predicate calculus without being fully restated, we may use the old numbers with 
“ P ii” suffixed (eg. “Theorem 1 1,,,,” here). Wc may also suffix ‘V’ if at the same time the 
“t=” is changed to " h”, etc. (e.g. "Theorem 6,. (l ” end § 19, “Theorem 6 1>d ,.” end § 25, 
“Theorem I2|.,,__” § 29). 
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3,, (1 , 15 and 16, in § 18. The present corollary does not require the full 
force of the theorem, which refers to any D. It can be inferred from the 
theorem with “t=” replaced by “1-N”, i.e. using only D — {1}. This is 
the way the simple consistency of the predicate calculus was first proved 
by Hilbert and Ackermann in 1928. 

The “only if” part, f E only if V E (extending Theorem 14 to the 
predicate calculus), is not as elementary as the theorems we have thus far 
given. This proposition is included in Godel’s completeness theorem 193 0, 
which we shall postpone to Chapter VI (T heorem 37 § 51). 

In classical logic (with which we are primarily concerned), we are 
interested in the valid formulas because they express universal logical 
truths, and in the provable ones because they are valid. 

In the case of the (classical) p ropositional calculus, constructions of 
truth tables (or truth-value analyses, § 8) constitute a sure and unimpeach- 
able method of demonstra ting validity, dire ctly f rom its d efinitiop in 
model theory. However" this method isn’t always very practical, and it 
usually - doesn’t resemble actual thought processes, in which the utmost 
economy is essential. This is partly why we introduced also pr oof th eory ^ 
Of course, we didn’t stop with the postulated form of proof theory, b 
we proceeded thence to derived rules (§ 13). I he postulated form of proof 
theory can be regarded there as a very convenient way station between 
model theory (validity, etc.) and the derived rules of proof the ory. 78 

Now, in the (classical) predicate calculus, direct application of the 
definition of validity is no longer a simple matter. For, demonstrations of 
validity by considering the truth tables are no longer purely mechanical, 
but require general reasoning about the truth tables for all (non-empty) 
domains £>; and for infinite D' s, the truth tables are infinite objects. 

Hence, in the pr edicate calculus, proof theoryjr as the advanta ge over 
model t heory, not only of convenience, but of greater concreteness. We 
are on firmer ground in establishing provability from its definition in 
§ 21, with the help of elementary theorems such as the deduction theorem 



Ts Our form of proof theory of the propositional calculus, so far as it consists in rules 
for establishing logical truths (§ 3 Paragraph 2), could be avoided by developing our 
derived rules entirely as propositions about model theory, beginning with la-lOb 
in Theorem 2, and Theorem 3 (cf. § 13 Paragraph 3). 

Then in the proof theory of the predicate calculus, one could simply take all tautol- 
ogies (valid formulas) of the propositional calculus as axioms en masse (instead of 
postulating our Axiom Schemata la-lOb or something similar). 

There are some good reasons for not thus leaving out our postulated form of proof 
theory in the propositional calculus. Some of the results in the proof theory of the 
predicate calculus and more complicated systems are most easily obtained by beginning 
with a treatment in the propositional calculus. In any case, it is good to practice with 
proof theory in the simpler situation first. In the in tuitionistic propositiona l calculus 
(end § 12), a simple truth-table method is not available. 
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and results based on the proof theory of the propositional calculus, than 
in demonstrating validity from its definition in § 17. By Theorem )2,, u , 
demonstrations of provability do establish validity, for anyone who 
accepts the reasoning in its proof. 70 

Godel’s completeness theorem for the predicate calculus has the 
significance that, for all true statements of validity, this method of 
demonstration suffices. In using this method, we escape from having to 
reason further in terms of the validity notion, after handling the few cases 
of it that we needed for the proof of Theorem 12,> (I . Demonstrations of 
all other true statements of validity can be put together out of repetitions 
of the reasoning used in handling those few cases. 

Because of our preference for the proof-theoretic approach to the predi- 
cate calculus on the ground of its greater concreteness, we now develop 
some of the further results proof-theoretically, including some which in 
Chapter 1 or § 19 we did rnodel-theoretically. 

Thloklm 13 again holds. For, as we remarked early in § 21, the direct 
rules carry over automatically from the propositional to the predicate 
calculus; and the three subsidiary deduction rules in Theorem 13 are 
established now by the same proofs from Theorem II,.,, in §22 as before 
from Theorem 1 1. We now add four more derived rules. 

Theorem 21. Let x be any variable , A(x) be any formula, r be any 
variable not necessarily distinct from x, and A(r) be the result of substituting 
r for the free occurrences of x in A(x). Also let P be any list of ( zero or more) 
formulas, and C be any formula. Then the following rules hold, provided: 

(A) For V -elimination and ^-introduction, r is free for x in A(x). 

(B) For V -introduction and 3-elimination, P does not contain x free. 

(C) For 3-elimination, C does not contain x free. 

Introduction Elimination 

V //FbA(x), VxA(x) b A(r). 

then P h VxA(x). 

3 A(r)b3xA(x). //P, A(x)bC, 

then P, 3xA(x) b C. 

79 A person has the option of taking the postulates as the starting point of logic, 
without attempting to put behind them an argument like the proofs of Theorem 2 tA , 
3,. dl 15, 16. Something very similar to those postulates must go into the intuitive reason- 
ing in those proofs anyway. On the other hand, giving those proofs explicitly does help 
to guard against mistakes, such as were made in formulating the postulated substitution 
rule in the early systems of the predicate calculus. 73 

There is some arbitrariness whether our particular set of postulates or some other is 
accepted. The problem of completeness (solved by Gddel, e.g. for our postulates) only 
appears to a person who is not merely accepting a set of postulates as the starting point 
of logic. 
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Proofs. V-elimination, 3-introduction. Cf. Exercise 21.5. 
V-introduction. Let C be an axiom not containing x free. 

1. F E A(x) — by hypothesis. 

2. T, C 1 A(x) — Exercise 13.2, 1. 

3. F A(x) — Theorem 11 P(1 , 2. 

4. fl-CD VxA(x) — from 3 using the V-rule, or in more detail thus: 
Consider a given deduction B x , . . . , B, of C 3 A(x) from T with all 
variables held constant. To this we can add as B i+1 the formula 
C 3 VxA(x), justified by the V-rule with B, as premise, since C does 
not contain x free. In the resulting deduction from T, all variables are 
held constant, since the new application of the V-rule is on x, which by 
Proviso (B) does not occur free in T. 

5. r, C i VxA(x) — Theorem 10 Pd (or modus ponens), 4. 

6. T E VxA(x) — Exercise 13.2, 5. 

3-elimination. 

1. I -1 , A(x) EC — by hypothesis. 

2. T E A(x) 3 C — Theorem 1 l Pd , 1. 

3. T E 3xA(x) 3 C - from 2, using the 3-rule (and Provisos (B), (C)). 

4. T, 3xA(x) EC — Theorem 10 Pd , 3. 

In Exercise 23.1 below we shall illustrate the importance of watching 
Provisos (B) and (C) in using these rules. The importance of (A) should 
appear from examples in §§ 21 and 18 which deal with essentially the same 
matter. 

The subsidiary deduction rule of V-introduction is unlike our other 
subsidiary deduction rules in having the same assumption formulas in 
the given or subsidiary deduction and the resulting deduction. It is stated 
as a subsidiary deduction rule because of Proviso (B). Except for this 
proviso, Exercise 13.3 would apply and give A(x) EVxA(x) . That 
however does not hold; for then, by Theorem 12 Pd (with Theorems ll Pd 
and 8 Pd ) we would have “A(x) E VxA(x)”; and this is not the case e.g. 
when A(x) is P(x) or (as we saw in Example 7 § 20) ~iP(x). 

Corollary 1. Let w„ . . . , w p be distinct variables, A(w 1; . . . , w„) 
be a formula, r t , . . . , r p be variables not necessarily distinct from each 
other or from w„ . . . , w P , and A(r„ . . . , r v ) be the result of substituting 
r x , . . . , r P simultaneously for the free occurrences of w x , . , , , w P respectively 
in A(w x , .... w p ). If this substitution is free (i.e. if the resulting occurrences 
of r x r p in A(r x r p ) are free), then: 

(a) Vwj . . . Vw P A(w 1( . . . , w p ) E A(r x r p ). (p-fold V-elimination.) 

(b) A(r„ . . . , r p ) E3wj . . . 3w p A(w w P ). (/7-fold 3-introduction.) 

Proofs, for p > 1 . (For p = 0, (a) and (b) are simply Theorem 
9 Pd (i) for m— 1. For p = 1, they are in the theorem.) (a) For illustration. 
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suppose p = 2, and w„ w, are x, y, and A(w,, w 2 ) is 3wP(w, x, y), and 
Tj, r., are y, /. Then the substitution is free. (It would not be if r„ r 2 were 
w, z.) We must show that VxVy3wP(w, x, y) l-3wP(w, y, z). Evidently 
we should use V-eliminations, but a little care is necessary. We cannot 
first eliminate Vx with y as the r to obtain Vy3wP(w, y, y), as the resulting 
occurrence of y (the second) would not be free. To use an easily-described 
uniform method, let x 1( x 2 be two distinct new variables, i.e. variables 
distinct from each other and all the others in the illustration. By two 
successive V-elims. (with Theorem 9 1I(1 (ii)): 

(1) VxVy3wP(w, x, y) f3wP(w, x,, x 2 ). Thence by two successive 
V-introds. : (2) VxVy3wP(w, x, y) L Vx!Vx 2 3wP(w, x lf x 2 ). By two succes- 
sive V-elims.: (3) VxiVx 2 3wP(w, Xj, x 2 ) H 3wP(w, y, z). Combining (2) and 
(3) by Theorem 9 Vd (ii): (4) VxVy3wP(w, x, y) h3wP(w, y, z). 

(b) Left to the student (Exercise 23.2 (b)). 

Corollary 2. (Substitution for individual variables.) 63 Under the 
circumstances of the theorem or Corollary 1, and provided T does not 
contain free occurrences of x or w„ . . . , w„, respectively. 

(c) //r b A(x), then V f A(r). 

(d) If r h A(w t , . . . , w„), then V L A{r 1( . . . , r p ). 

Proof is Exercise 23.3. 

Ext RC lsi.s. 23.1. In each of the following, the formula in the last 
statement is not valid (by Exercises 17.3, 17.5) and therefore (by Theorem 
12,, d ) not provable, contrary to what is asserted. Locate the fallacy. 

I. “1. P(x) h P(x) — (i) (in Theorem 9 Pd ). 

2. P(x) h VxP(x) — V-introd., 1. 

3. 3xP(x) h VxP(x) - 3-elim., 2. 

4. P3xP(x) 3 VxP(x) — 3-introd., 3.” 

II. “1. P(x), Q(x) l-3x(P(x) &Q(x)) — &- and 3-introd. 

2. P(x), 3xQ(x) H3x(P(x) & Q(x)) - 3-elim. , 1. 

3. 3xP(x), 3xQ(x) P3x(P(x) & Q(x)) — 3-elim., 2. 

4. 3xP(x) & 3xQ(x) h 3x(P(x) & Q(x)) — &-elims., 3 (with (ii)). 

5. h 3xP(x) & 3xQ(x) 3 3x(P(x) & Q(x)) - 3-introd., 4.” 

III. "I. P(x) h P(x) - (i). 

2. 3xP(x) h P(x) - 3-elim., I. 

3. h 3xP(x) 3 P(x) — 3-introd., 2.” 

23.2. (a) Write out the six V-elims. and V-introds. in the illustration of 
the proof of Corollary 1 (a), and verify that the provisos are satisfied, 
(b) Give the proof of Corollary 1 (b) with the same illustration. 

23.3. Prove Corollary 2. Also simplify the statement for the case T 
is empty. 
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23.4. Show that: For each variable x and formula A, E A if and only 
if E VxA; and hence, E A if and only if E VA (cf. § 20 for “V”). 

23.5. Show that: V'A l5 . . . , V'A,„ F B if and only if 
V'Aj, .... V'A m E V'B (where V' ~ Vx, . . . Vx, ; as in § 20). 

23.6. True or false (and why)? 

(a) "V/3yVxP(x, y, z) E Vx3yVzP(x, y, z).” 

(b) “3xVyVzP(x, y, z) E Vz3xP(x, x, z).” 

23.7. For each of the following modifications of the predicate calculus, 
say which of the two implications {EE} {E E} (Theorem 12j, d and 
Chapter VI) continue to hold for all formulas E, and why. 

(a) Add as a new axiom schema VxA(x) 3 3xA(x). 

(b) Add as a new axiom schema 3xA(x) 3 VxA(x). 

(c) Omit the V-schema, V-rule, 3-schema and 3-rule. 

§ 24. Proof theory: replacement, chains of equivalences. 
Theorem 22. Let x be any variable , and A, B, C, A(x), B(x) be any 
formulas , and for *69a-*72a let F be any list of (zero or more) formulas pot 
containing x free. M> 

*6. AOBE(BOC)D(A3C). ‘7. A3B E(CDA)D(CDB). 

*8a. A3B E A & C 3 B & C. *8b. A3BEC&A3C&B. 

*9a. AOBEAVC3BVC. *9b. A3BECVA3CVB. 

*12. A 3 B I — iB 3 — iA. *13. ADiB EB DiA. 

*14°. “iA 3B E~iB3 A. *15°. mA 3~iB EB3 A. 

*69a. If l* E A(x) 3 B(x), then F E VxA(x) 3 VxB(x). 

*70a. IfV E A(x) 3 B(x), then F E 3x A(x) 3 3xB(x). 

(Introduction of a logical symbol into an implication, 
including contraposition with double negations suppressed.) 

*25a. A ~ B E(A~C)~(B~C). *25b. A ~ B E (C~ A) ~ (C ~ B). 
*26. A~BEA3C~B3C. *27. A ~ B E C 3 A ~ C 3 B. 
*28a. A ~ B EA & C ~ B & C. *28b. A~B EC&A~C&B. 
*29a. A~B E A V C ~ B V C. *29b. A~BECVA~CVB. 

*30. A~B I — iA ~ ~ iB. 

*71 a. 7/F EA(x)~B(x), then F E VxA(x) ~ VxB(x). 

*72a. If F EA(x)~B(x), then F E 3xA(x) ~ 3xB(x). 

(Introduction of a logical symbol into an equivalence.) 

As before (Theorem 2), we are numbering results in agreement with 1M, as far as 
possible. (*69a-*72a are modifications of IM *69-*72. *25a, *25b, *40a, *40b do not 
occur as numbered results in IM.) 28 
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*40a. 

A f ( A ~ B) ■ — • 


*40b. 

-iA 1- (A ~ B) ~ 



(B ~ A) ~ B. 



(B~A)~mB. 


*41. 

A k A 3 B — B. 


*43. 

-iA h A 3 B ~ ~ iA. 


*42. 

A b B 3 A~ A. 


*44. 

~iA 1- B 3 A ~ ~iB. 


*45. 

A 1 A & B B & A 

B. 

*47. 

— iA LA&B' — - B & A > — ' 

A. 

*46. 

A 1AVB~BVA~ 

A. 

*48. 

iA f A V B ■ — B V A ~ 

B. 


(Special cases of the binary propositional connectives.) 


Proofs. All of the results except the four involving quantifiers 
(*69a-*72a) can be established in the propositional calculus. This can 
be done automatically with k, by truth table computation with P, Q, R, 
followed by substitution in the form of Exercise 7.3; 81 and then k can be 
changed to h by completeness (Theorem 14 with Theorems 8, 10); or we 
can use the rules of Theorem 13, employing *6-* 12 in the proofs of 
*26-*30. Then they can be taken over into the predicate calculus, by 
beginning § 21 . (Exercise 24.1.) 

*69a, *70a. *70a is the result of Exercise 21.4, when we employ a 
deduction of A(x) 3 B(x) from P instead of from Vx(A(x) 3 B(x)). 
*69a is obtainable similarly, or by generalizing Example 14 from P(x), 
Q(x) to A(x), B(x) and replacing the first three steps by the assumed 
deduction of A(x)3B(x). Or we can use the rules of Theorems 13 
and 21 (Exercise 24.2). 

*7 la, *72a. *7 la is established as follows, and *72a similarly. 

1. I' b A(x) -- B(x) 1 A(x) 3 B(x) — hypothesis; ^-elim. 36 

2. T t- VxA(x) 3 VxB(x) - *69a, 1. 

3. 1’ h VxB(x) 3 VxA(x) — similarly. 

4. P h VxA(x) ~ VxB(x) — ~-introd., 2, 3. 

Now we can extend Theorem 5 to the predicate calculus with “P (■" in 
place of “k”, (The extension with “k” is in § 19.) 

Theorem 23. (Replacement theorem.) Let C A be a formula con- 
taining a formula A as a specified ( consecutive ) part, and let C JS come from 
C A hv replacing that part by a formula B. Let Xj, . . . , x a be the free 
variables of A or B which belong to quantifiers of C A having the specified 
part within their scopes (i.e. x lf . . . , x, are the variables having free 
occurrences in A or B which become bound occurrences in C A or C B ). Let 
P be a list of ( zero or more) formulas not containing any o/x„ . . . , \ Q free 
Then: 

If I’ L A B, then P L C A < — C )t . 

Prooi. Wc illustrate the proof by four examples. 

5,1 The tables (1) in § 8 provide shortcuts to *40a-*48. 
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Example 15. Let C A be R 3 Vz( ~i3xP(x, y, z) V Q(z)) with the 
specified part A underlined. Let “T b A ~ B be 

“ b -iBxP(x, y, z) ~ Vx-iP(x, y, z)” (with F empty), which holds by 
*82a in Theorem 26; i.e. the formula 1 below is provable. Using *29a, 
*7 la and *27 we infer successively that 2-4 are also provable. 

1. -i3xP(x, y, z) — Vx-iP(x,y, z) 

2. -i3xP(x, y, z) V Q(z) ~ Vx-iP(x, y, z) V Q(z) 

3. Vz(~ i3xP(x, y, z) V Q(z)) ~ Vz(Vx-iP(x, y, z) V Q(z)) 

4. R 3 Vz(— i3xP(x, y, z) V Q(z)) ~R3 Vz(Vx-iP(x, y, z) V Q(z)) 

Thus b R 3 Vz(-i3xP(x, y, z) V Q(z)) ~ R 3 Vz(Vx-iP(x, y, z) V Q(z)), 

which is what the theorem asserts in this example. 

Example 16. Let C A be P(x) V Vx3y(P(x) 3 Q(y)) with the specified 

part A underlined. Let ‘T b A ~ B” be 

“VxVy[P(x) ~ 3zR(x, y, z)] h P(x) ~ 3zR(x, y, z)”, which holds by 
double V-elim.; i.e. the formula 1 below is deducible from the T. Using 
*26, *72a (since the F does not contain y free), *71a (since x isn’t free in 
T), *29b, we infer successively that 2-5 are also deducible from the F. 


1 P(x) ~ 3zR(x, y, z) 

2. P(x) 3 Q(y) ~ 3zR(x, y, z) 3 Q(y) 

3. 3y(P(x) 3 Q(y)) — 3y(3zR(x, y, z) 3 Q(y)) 

4. Vx3y(P(x)3Q(y))~ Vx3y(3zR(x, y, z) 3 Q(y)) 

5. P(x) V Vx3y(P(x) 3 Q(y)) ~ P(x) V Vx3y(3zR(x, y, z) 3 Q(y)) 

Thus VxVy[P(x)~3zR(x, y, z)] b 

P(x) V Vx3y(P(x) 3 Q(y)) ~ P(x) V Vx3y(3zR(x, y, z) 3 Q(y)), 
which is what the theorem asserts in this example. 

Example 17. In Example 16 change B to 3zR(x, w, z). Since w does 
not become bound in replacing the specified P(x) by 3zR(x, w, z), it is 
permitted to be free in the F. Thus Vx[P(x) ~ 3zR(x, w, z)] b 

P(x) V Vx3y(P(x) 3 Q(y)) ~ P(x) V Vx3y(3zR(x, w, z) 3 Q(y)). 
Example 18. Changing the specified occurrence of P(x) in Example 16 
to be the first one, C A ~ Cji is deducible from P(x) ~ B (using simply 
*29a), whatever the B. 

Corollary 1. For C A , C H , x„ . . . , x„ as in the theorem: 

A ~ B b x -- x « C A ~ C„. 

Proof. Apply the theorem with Vxj . . . Vx,/A~ B) as the I , as in 
Examples 16, 17. Remember that "A ~~ B b x i- x " ’ means 
“Vx, . . . Vx, ; (A ~ B) b”. 

Corollary 2. (Replacement property of equivalence.) Similarly: 
(a) If F b A~B, then F, C A b C„. (b) A ~ B, C A b x ‘ -N C„. 



124 


Till: PREDICATE CALCULUS 


CH. II 


Now wc can return to (a)-(0 in § 5, and verify that they hold with “E” 
replaced by ‘T E” (Exercise 24.3). So with Theorem 23 (or its Corollary 
I ), we have all the ingredients for using chains of equivalences in proof 
theory, with “T E” standing before the chains (instead of “E” as in 
§§ 5, 19). The formulas T must not contain free any of the variables 
x,, . . . , x,, of quantifiers within the scopes of which replacements are 
performed (or such variables which they do contain free must be written 
as superscripts on “E”). In most of our applications (e.g. those in the 
proofs of Theorems 25 and 26), P is empty (as in Example 15); i.e. the 
equivalences are provable, and we need not worry about Xj, . . . , x„. 

Suppose that in C A (as in Theorem 23) the specified part A is not in 
the scope of any ~ (i.e. it is not in the D of any part D ~ E or E ~ D). 
We then say the specified part A is positive or negative according as it is in 
the D of an even or odd number of parts of the form ~iD or the form 
DDE. For example, in d3x((P(x) D Q) V R) D VyP(y) the part P(x) 
is negative (count the underlined operators), and the part Q is positive 
(count the overlined operators). 


Theorem 24. Under the circumstances of Theorem 23, and supposing 
further that the specified part A is not in the scope of any ~ : 


If 1'EADB, then V E 


C.v ^ Cm 


if the-specified-part-A-is 


Corollary. 


A D B E* 1 - x « 



Proofs as Exercise 24.4. 


Lemma 5. Let x he any variable, A(x) be any formula, y be any variable 
(not necessarily distinct from x) such that 

(i) y is free for x in A(x), 

(ii) y does not occur free in A(x) ( unless y is x), 

and A(y) be the result of substituting y for the free occurrences of x in A(x). 
Then: 

*73. E VxA(x) ~ VyA(y). *74. E 3xA(x) ~3yA(y). 

Prools. *73. By the right column in Example 11. *74. Similarly. 

Theorem 25. If A is congruent to B, then E A ~ B. 

Proof. We illustrate this by the example (1) and (2) of congruent 
formulas end § 16. In order to employ a uniform method applicable in 
all cases (though short cuts will generally be available in particular cases), 
we pick a set of new variables x„ x 2 , x 3 to correspond to the three quanti- 
fiers of either. By successive replacements of the variable-occurrences 
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bound by one quantifier at a time, we can transform I (= (1)) into 4 
(cf. (lb), (2b)) and that into 7 (= (2)), thus: 

1. Vx (P(x ) & 3x Q(x , z) 3 3y R(x , y )) V Q(z, x). 

2. Vx (P(x ) & 3x 1 Q(x 1 , z) 3 3y R(x , y )) V Q(z, x). 

3. Vx (P(x ) &3x,Q( Xl , z) 3 3 x 2 R(x , x 2 )) VQ(z, x). 

4. Vx 3 (P(x 3 ) & 3x 1 Q(x 1 , z) 3 3x 2 R(x 3 , x 2 )) V Q(z, x). 

5. Vy (P(y ) & 3x l Q(x 1 , z) 3 3x 2 R(y , x 2 )) V Q(z, x). 

6. Vy (P(y ) & Sx^Xl z) 3 3z R(y , z )) V Q(z, x). 

7. Vy (P(y ) & 3x Q(x , z) 3 3z R(y , z )) V Q(z, x). 

The proof that f A ~ B (here A is (1) and B is (2)) is given by the chain 
method. E.g. for the “third link”, the hypotheses of Lemma 5 are 
satisfied by x, P(x) ASx^fx,, z) 3 3x 2 R(x, x 2 ), x 3 as the x, A(x), y; so 
by *73 the underlined parts of 3 and 4 are equivalent (i.e. the formula 
VxA(x) ~ VyA(y) expressing their equivalence is provable); so by 
Theorem 23, 3 is equivalent to 4. 

Exercises. 24.1. Run through all steps in the proofs of *6, *7 and 
*26 by each of the suggested methods. 

24.2. Establish *69a and *70a by the rules of Theorems 13 and 21, 

24.3. Verify that (a)-(0 in § 5 hold for the predicate calculus with 
“F h”. (For ((3)-(5), cf. •19-*21 in Theorem 2.) 

24.4. Prove Theorem 24 and Corollary. 

24.5. Justify the following. Format (BJ § 13 is used in (b). 

(a) 1. Vx(P(x) 3 M(x)) h P(x) 3 M(x). 

2. Vx(P(x) 3 M(x)) l-3x(S(x) & iM(x)) 33x(S(x) & ~iP(x)). 

3. Vx(P(x) 3 M(x)), 3x(S(x) & -iM(x)) H3x(S(x) & mP(x)). 

This expresses the traditional Aristotelian syllogism “Baroco”: All P 
are M. Some S are not M. Therefore some S are not P. 

(b) Assume Vx(P(x) 3 -iM(x)). Thence successively P(x) 3 -iM(x), 
M(x) 3 -iP(x), 3x(S(x) & M(x)) 3 3x(S(x) & — tP(x)). — Thus 

Vx(P(x) 3 -lM(x)), 3x(S(x) & M(x)) E3x(S(x) & iP(x)) (“Festino”). 82 

§ 25. Proof theory: alterations of quantifiers, prenex form. 
We shall use the proofs for Theorem 26 below to provide further illustra- 
tive and practice material in the technique of § 13 for using our derived 
rules. Formats “(A)”, “(Bj)”, “(B z )”, “(B s )” are used as there. Now we 
have in addition the four introduction and elimination rules for the quanti- 
fiers (Theorem 21). The provisos for these rules must be observed carefully, 
as always. 


82 The translation into our symbolism is explained in § 26 below. Thirteen others of the 
Aristotelian syllogisms can be established similarly, while four fail to hold when similarly 
translated. Cf. Example 23 § 27, Hilbert and Ackermann 1928 Chapter 3 §3. 
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The rule of V-introduction was stated in Theorem 21 as a subsidiary 
deduction rule in which the assumption formulas are not changed from 
subsidiary to resulting deduction. The effect is that, when we are listing 
formulas deducible from assumptions F not containing x free, we can 
proceed by V-introduction from A(x) to VxA(x). 

The rule of 3-elimination serves to eliminate the 3x from 3xA(x) pre- 
paratory to inferring from 3xA(x) (and possibly other assumptions F not 
containing x free) a consequence C not containing x free. The procedure 
corresponds to the following familiar form of reasoning in mathematics: 
“There is an x such that T(.r); now pick such an x. . .” or “There is a 
number with the property A; let x be such a number . . If through a 
chain of reasoning using x a conclusion is reached not containing x, the 
conclusion can be taken to be established without the assumption about x. 

One of the purposes of studying logic formally is to improve our relia- 
bility in the use of logic. This improvement is obtained by our being able, 
in cases of doubt, to trace how our proposed reasoning (or the reasoning 
of others) can be effected by the principles we have studied, or in brief 
how the reasoning can be “formalized”. Also, as will appear in Chapter 
IV, we may sometimes wish to do so to make clear the bases of the rea- 
soning, whether or not we have doubts about its validity. 

Therefore, it is an objective of the formal study of logic to bring the 
methods of recognizing that formulas are provable in the strict sense of 
proof theory (§§ 9, 21) as close as possible to the methods which are used 
informally. 

This we do not claim can be done once and for all, for all kinds of 
reasoning. However, in our opinion, the rules for introducing and elimi- 
nating logical symbols do this very successfully for the ordinary use of 
predicate logic, not just for proving results within logic (as in our next 
illustrations) but also in the application of logic in mathematics and else- 
where. The remarks at the end of § 13 on supplementing these rules 
appropriately by other devices should be kept in mind. 

The introduction and elimination rules are due essentially to Gentzen 
1932. 1934-5, 1936 §5 (who profited from earlier work of Hertz 1929) 
and Jaskowski 1934. 8a These authors were primarily interested in formula- 
tions of logic with such rules as postulated rules, while we have them as 
derived rules; cf. § 13. Our format (B 2 ) corresponds closely to Gentzen’s 
“natural deduction” and Jaskowski 1934. 

It is controversial whether one does better to start with a postulated 
formulation of logic based on modus ponens^ta “Hilber t-type system”, 

" J These rules include the deduction theorem, originating in 1930 and before with 
Herbrand and Tarski. 33 
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§§ 50, 54) and derive the introduction and elimination rules, as we have 
done, or to start out with a version of the latter as the postulated rules 
(a "Gent/en-type system”). 

No one formulation is most convenient for all purposes. One formula- 
tion with postulated axioms and rules is necessary to give a firm starting 
point for proof theory. But it is our view that, no matter what formulation 
one begins with, he will not want simply to use it, but rather he will soon 
wish to escape its limitations by devising short cuts, proving theorems that 
introduce new methods, etc. The history of mathematics is one of con- 
tinually building on previous discoveries, and of devising methods which 
telescope or reorganize steps previously carried out separately into new 
processes of reasoning. 

This being the case, we have preferred to start from the structurally 
simpler Hilbert-type systems in §§9, 21 . These also lend themselves well to 
the addition of mathematical axioms and axiom schemata to establish 
systems of logic and mathematics, of which we shall have more to say in 
Chapter IV. H1 

Theorem 26. Lei x, y be any ( distinct ) variables , A(x), B(x), A(x, y) be 
any formulas, anil A, B be any formulas not containing x free. For *79 aiul 
*80, further let A(x, x) be the result of substituting x for the free occurrences 
of y in A(x, y), and suppose that x is free for y in A(x, y) (i.e. the occurrences 
of x in A(x, x) resulting from this substitution are all free). 8 ’’ 


*75. 

E VxA A. *76. 

E 3xA ~ A. 

*77. 

E VxVyA(x, y)~ *78. 

E3x3yA(x, y) ~ 


VyVxA(x, y). 

3y3xA(x, y). 

*79. 

E VxVyA(x, y) =) *80. 

E3xA(x, x) D 


VxA(x, x). 

3x3yA(x, y). 


*81. E VxA(x) 3 3xA(x). 



*82. E 3xVyA(x, y) 3 Vy3xA(x, y). 


(Alterations of quantifiers.) 

*82a. 

1 — i3xA(x) ~ Vx - tA(x). *82b°. 

1 — iVxA(x) ~3x“iA(x) 

*83'\ 

E 3xA(x) — ■ ~ tVx-tAfx). *84°, 

E VxA(x) ~ -i3\-tA(x), 


(Negation and quantifiers.) 


For purposes which will arise in Chapter VI, we shall then also want postulated 
Genl/en-lype systems, proved equivalent to our present Hilbert-type system. 

' The numbering follows IM. (*82a, *82b arc the *86, *85 ol'IM renumbered to come 
earlier. *99b is not stated in IM.) 
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*87. 

b VxA(x) & VxB(x) ~ 

*88. 

b3xA(x)V3xB(x)~ 


Vx(A(x) & B(x)). 


3x( A(x) V B(x)). 

*89. 

b A & VxB(x) ~ 

*90. 

b A V 3xB(x) ~ 


Vx(A & B(x)). 


3x(A V B(x)). 

*91. 

b A & 3\B( x) — > 

*92". 

bAVVxB(x)~ 


3\( A & B(x)). 


Vx(A V B(x)). 

*93. 

r 3\( A(x) & B(x)) 3 

*94. 

b VxA(x)V VxB(x)3 


3.\A(x) & 3.x B( x). 


Vx(A(x) V B(x)). 


(Conjunction or 

disjunction, 

and quantifiers.) 

*95. 

b A 3 VxB(x) — 


b A 3 3xB(x) ~ 


Vx( A 3 B(x)). 


3x(A 3 B(x)). 

*98". 

b VxA(x) 3 B -- 

*96. 

b3xA(x) 3B~ 


3\( A(\) 3 B). 


Vx(A(x) 3 B). 


*99' . I- VxA(x) 3 3xB(x) ~ 3x( A(x) 3 B(x)). 
*99 b. b (3xA(x) 3 VxB(x)) 3 Vx(A(x) 3 B(x)). 

(Implication and quantifiers.)' 


Proois. 


>*75/ I. VxA f A - V-elim. (with x as both the x and the r and 
/Cas the A(x), so trivially the r is free for the x in the A(x)). 

2. b VxA 3 A - 3-introd., I . 

(A) 3. AHA. 

4. A b VxA — V-introd., 3 (which is legitimate since A does not 
contain x free), 

5. b A 3 VxA — 3-introd., 4. 

6. b VxA ~ A — ~-introd., 2, 5 .t (> 


I. Assume ^xA (preparatory to 3-introd.). By V-elim., 2 A. 
(B t ) II. Assume .,A, which docs not contain x free. So by V-introd., 
s VxA. 


(B s ) 


1. VxA — assumed. 

,, 2. A — V-elim., 1 . 

3. VxA 3 A - 3-introd., 2. 

4. A -- assumed. 

,, 5. VxA - V-introd., 4. 

6. A 3 VxA — 3-in l rod., 5. 

7. VxA A -- ~-introd., 3, 6. 
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1 . VxA I- VxA. 

2. VxA I- A — V-elim., 1. 

3. E VxA 3 A — 3-introd., 2. 

4. A EA. 

5. A E VxA — V-introd., 4. 

6. EA3 VxA — 3-introd., 5. 

7. E VxA ~ A — ~-introd., 3, 6. 

'*76) 1. A E A. 

2. 3xA E A — 3-elini., 1 (noting that A as the C does not contain 
x free). 

3. E 3xA 3 A — 3-introd., 2. r , s 

4. A E3xA — 3-introd. (with x as the r).T it 

5. EA3 3xA — 3-introd., 4. ’t. 

6. E3xA ~ A — introd., 3, 5 .t,, 

\*80)l. A (x, x) E 3x3yA(x, y) — 3-introd. twice (first with y 
as the x and x as the r, using the hypothesis that x is free for y in 
A(x, y); then with x as both the x and the r); or by Corollary 1 (b) 
Theorem 21. 

2. 3xA(x, x) E 3x3yA(x, y) — 3-elim., 1 (noting that 3x3yA(x, y) 
as the C does not contain x free). 

3. E 3xA(x, x) 3 3x3yA(x, y) — 3-introd., 2. 

'^82a) 1. — i3xA(x), A(x) E -j3xA(x). 

2. ~i3xA(x), A(x) E 3xA(x) — 3-introd. (with x as the r). 

3. ~i3xA(x) E -iA(x) — — i-introd., 2, 1. 

4. ~i3xA(x) E Vx~ tA(x) — V-introd., 3 (noting that ~i3xA(x) as 
the P does not contain x free). 

5. E ~i3xA(x) 3 Vx~iA(x) — 3-introd., 4. 

6. Vx - iA(x), A(x) E A(x). 

7. Vx - iA(x), A(x) I — iA(x) — V-elim. 

8. Vx"iA(x), A(x) I — iVx~iA(x) — weak —i-elim., 6, 7. 

9. Vx - iA(x),3xA(x) I — iVx~iA(x) — 3-elim. , 8 (noting that neither 
~iVx _ iA(x) as the C nor Vx - iA(x) as the F contains x free). 

10. Vx-iA(x), 3xA(x) E Vx~ iA(x). 

1 1. Vx-iA(x) E -i3xA(x) — -i-introd., 10, 9. 

12. E Vx-tA(x) 3 — i3xA(x) — 3-introd., 1 1. 

1 3. E ~i3xA(x) <— Vx - iA(x) — -■ — introd., 5, 1 2. 
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I. Assume ,-i3x/\(x). Preparatory to rcductio ad absurdum 
(“i-i n trod.), assume 2 A(x). By 3-introd., ;l 3xA(x), contradicting 
-i3.\A(x). So by the -i-introd. (discharging the assumption A(x)), 
4 ~ iA(x). By V-introd. (as the remaining assumption -i3xA(x) doesn’t 
contain \ free), 5 Vx~iA(x). II. Assume 7 Vx~tA(x). For -i-introd., 
(B,) assume s 3\A(x). Preparatory to 3-elim., assume 0 A(x). From 

Vx-iA(x) by V-elim -iA(x), contradicting A(x). By weak — i-elim., 

u -iV\ _ iA(x). Since neither this formula nor the assumption formulas 
V\— tA(\), 3xA(x) other than A(x) contain x free, we can now com- 
plete the 3-elim., and then (noting the contradiction to Vx-iA(x)) the 
— i-introd., to obtain l3 — i i3xA(x). 

1. — >3xA(x) — assumed. 

2. A(x) — assumed. 

3. 3xA(x) — 3-introd., 2. 

4. ~iA(x) — — i-introd., 3, 1. 

5. Vx~tA(x) — V-introd., 4. 

6. - i3xA(x) D Vx— tA(x) — D-introd., 5. 

7. Vx-tA(x) — assumed. 

8. 3xA(x) — assumed. 

9. A(x) — assumed. 

10. -iA(x) - V-elim., 7. 

1 1. -tVx-iA(x) weak “i-elim., 9, 10. 

12. -tVx-iA(x) -- 3-elim., 1 1. 

13. ~i3xA(x) — -i-introd., 7, 12. 

14. Vx-iA(x) D ~i3xA(x) — D-introd., 13. 

15. -i3xA(x) ~ Vx-iA(x) — ~-introd., 6, 14. 

-:3xA(x) b — i3xA(x). 
x), ~i3xA(x) b A(x). 
x), -i3xA(x) l-3xA(x) — 3-introd., 2. 

-i3xA(x) H “iA(x) — -i-introd., 3, 1. 

-i3xA(x) b Vx— iA(x) — V-introd., 4. 

b — i3xA(x) D Vx~iA(x) — D-introd., 5. 
Vx-iA(x) bVx~ iA(x). 
x), Vx-iA(x) b3xA(x). 

9. A(x), 3xA(x), Vx“iA(x) b A(x). 

10. A(x), 3xA(x), Vx-iA(x) b -iA(x) — V-elim., 7. 

1 1. A(x), 3xA(x), Vx-iA(x) b -iVx-iA(x) — weak -i-elim., 9, 10. 

12. 3xA(x), Vx-iA(x) b — iVx— iA(x) — 3-elim., II. 

13. Vx-iA(x) b -i3xA(x) “i-introd., 7, 12. 

14. b Vx - tA(x) D -i3xA(x) — D-introd., 13. 

15. b — i3xA(x)~ Vx - iA(x) — introd.,6, 14. 


(B 2 ) 


2 , 

3. 

4. 

5. 

6 . 
7. 

( B;j) 8. 


A( 

A( 


3xA( 
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At each step the results in (B 2 ) which are opposite no other arrows are 
available. Thus for 4, we can use I as one of the two “given deductions” 
for the “i-introd., since from -i3xA(x) F - i3xA(x) we have (by general 
properties of F) A(x), - i3xA(x) F ~i3xA(x). Though we could suspend the 
assumption 3xA(x) for Lines 9-1 1, we prefer to keep it in force throughout 
Lines 8 12, so the immediate result of the 3-eIim. applied to 1 1 is 3xA(x), 
3xA(x), Vx — iA(x) I — iVx - iA(x). (Cf. Lines 12-17 in (B :i ) of Example 9 
§13.) 

*82b. We use the chain method (available by § 24). F 3x~iA(x) ~ 
- )3x~ iA(x) [*49] ~ — iVx-i - iA(x) [*82a] ~ -iVxA(x) [*49], 

*87-*94. All of these can be established in pairs, as we illustrate next 
for *91 and *92. (Also *88, *90, *94 can be established by direct methods, 
like *91.) 

*91. f. Assume for D-introd., 4 A &3xB(x). Thence by &-elims., 
2 A and 3 3xB(x). Assume for 3-elim., 4 B(x). By &-introd., 5 A & B(x). 
By 3-introd., „3x(A & B(x)). Now complete the 3-elim. and D-introd. 
(B,) II. Assume „3x(A & B(x)). Assume for 3-elim., 10 A & B(x). By 
&-elims., U A and T 2 B(x). By 3-introd., 13 3xB(x). By &-introd., 
14 A & 3xB(x). Complete the 3-elim. and D-introd. 

*92. F A V VxB(x) ~ — i(~iA & iVxB(x)) [*56] ~ 

— 1( — iA &3x~iB(x)) [*82b] ~ — i3x(~iA & “iB(x)) [*91] ~ 

Vx-i(-iA & ~iB(x)) [*82a] ~ Vx(A V B(x)) [*56]. 

*95-*99b. These can all be obtained by the chain method from *88 
*90, *92, *94 via *59 ( F A D B ~ ~iA V B), *82a, *82b (and *34). The 
transformations can be performed mentally (with a little practice), so this 
can be used to remember them from *88-*94. (Also some of them can be 
established by direct methods.) 


It can be remembered that *87 and *88 hold rather than the similar 
formulas with V and V or with 3 and & (cf. *94, *93), since V and & are 
kindred operations (V can be thought of as a conjunction extended over D; 
e.g. if D = {1, 2, 3}, then VxP(x) is synonymous with P(l) & P(2) &P(3) 
in the extension of the predicate calculus having the symbols “1”, “2”, 
“3” as names for the members of D ), and so are 3 and V. 

Theorem 6° and Corollary 0 now extend to the predicate calculus 
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with h (instead of k), where the operation f now includes the interchange 
of V with 3. The proof is as before, now using also *82a and *82b. 8B 

A prene.x formula is one with all its quantifiers (if any) at the front, 
with all the rest of the formula as scope of the prefixed quantifiers. Thus 
Vx3 v ( [*( x ) V Q(y)) is prenex but not Vx(3yP(x) V Q(y)). If E is equivalent 
to 1- (i.e. I- E~ F) and F is prenex, F is a prene.x form of H. 

Theorem 27". Each formula E has a prenex form. 

Proof. Say for example E is - t3xP(x) V VxQ(x). We can transform 
this E to a prenex formula F by a chain of equivalences, thus: 

h — i3xP(x) V VxQ(x) 

~ Vx-iP(x) V VxQ(x) [*82a] 

~ Vx( — iP(x) V VxQ(x)) [*92 with *34] 

Vx(~ iP(x) V VyQ(y)) [Theorem 25, or *73] 

VxVy(~iP(x) V Q(y)) [*92]. 

In the first application of *92 (with *34), VxQ(x) (which does not contain 
x free) is the A and ~i iP(x) is the B(x). Then we use Theorem 25 to replace 
VxQ(x) by VyQ(y), so that in the final application of *92 (with y as the x) 
the A will not contain the x free. 

The intuitionistic predi cate calculus. (Cf. end § 12.) Simply by 
using Axiom Schema^S^nf^place of Axiom Schema 8 (i.e. basing the 
predicate calculus on the intuitionistic propositional calculus instead of 
on the classical propositional calculus), we obtain the intuitionistic predicate 

86 All that is lacking to extend Theorem 7 (duality) to the predicate calculus with 
E, where ' now includes the interchange of V with 3, is an easy special case of the sub- 
stitution rule (Theorems 1, 17) in the predicate calculus with E (to give f Et* from 

h E + ). 

The idea of the proof-theoretic treatment of substitution for atoms in the proposi- 
tional calculus and for ions in the predicate calculus (and of some other subsidiary- 
deduction substitution rules) is to perform throughout an entire proof the substitution 
desired in the proved formula, and then to argue that the result is still a proof. This is 
straightforward in the propositional calculus (except that, if the substitution involves a 
change in the underlying language, it must extend to the atoms if any in the proof that 
are not in the proved formula); cf. IM Theorem 3 pp. 109 ff. In the predicate calculus, 
there are the risks that the substitution into a proof might introduce a free x into the C 
of an inference by the V-rule or the 3-rule (invalidating the inference), or introduce 
quantifiers into the A(x) of an axiom by the V-schema or the 3-schema so that the r 
would no longer be free for the x (invalidating the axiom); cf. IM pp. 159 ff. (on p. 159 
line 3 from below, add "or no free variables"). Neither of these two complications can 
arise in the very simple cases of substitution for ions that are needed in this book (as 
here, the substitution of negations of the ions for the ions). Substitutions into deduci- 
bility relationships can be treated similarly, or via the deduction theorem etc. as in 
Exercise 7.3. 
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calculus (proof-theoretically). Results established using only the derived 
rules of Theorems 13 and 21, except double negation elimination, hold 
good for it. 87 

Exiatcishs. 25.1. Translate the following from the format (Bj) into 
(B 3 ) and (B 3 ), and check that all steps in (B 3 ) are correct. 

(a) The proof of *91 given above. 

(b) The following proof of *96: I. Assume (preparatory to 3-introd.) 
3xA(x) 3 B. Assume (for 3-introd.) A(x). By 3-introd. 3xA(x), and by 
3-elim. B. By 3-introd. (discharging the assumption A(x)), A(x) 3 B. By 
V-introd., Vx(A(x) 3 B). II. Assume Vx(A(x) 3 B) and 3xA(x). By V-elim. 
from the former, A(x) 3 B. Assume preparatory to 3-eIim. from the latter, 
A(x). By 3-elim., B. 

25.2. Give proofs of *82 and *90 by direct methods in the format (Bj), 
and rewrite them in the formats (B.,) and (B 3 ). 

25.3. Attempt similar treatment, and locate the fallacy. (Reduce the step 
in question to a proposed explicit application of one of the introduction 
and elimination rules, and show what requirement would be violated.) 

(a) The converse of *82 (Vy3xA(x, y) 3 3xVyA(x, y)). (Cf. Example 3.) 

(b) *92. 88 

25.4. Prove *83, *95, *98 from earlier results by the chain method. 

25.5. Prove *99b from *94, and also give a direct proof in the format 

(B,). 

25.6. Give proofs in the format (B x ) of: 

(a) b 3x~ lA(x) 3 — iVxA(x) (cf. *82b). 

(b) f VxA(x) & 3xB(x) 3 3x(A(x) & B(x)). 

25.7°. Establish V Vx(A(x) V B(x)> 3 3xA(x) V VxB(x). 

25.8*. Show that *80 does not hold in general without the restriction 
stated in the second sentence of the theorem. 

25.9. Show that *9! does not hold in general without the restriction 
that A not contain x free. 

25.10. Do Exercise 19.4 with “A is equivalent to B” now meaning 
f A~B. 

25.11. Reduce to prenex form: 

(a) (~i3xP(x) V VxQ(x)) & (R 3 VxS(x)). 

(b) — 1 {( — i3xP(x) V VxQ(x)) & (R 3 VxS(x))}. Save steps by applying 
Theorem 6,, a) . to the result of (a). 

87 Fuller lists of results like those in Theorems 2 (but with “ f”), 22 and 26 that hold 
intuitionistically (written without “°”) are given in IM in Theorems 5,7, 17 and Corollary 
on pp. 114, 119, 163, 165-166. 

88 P V VxQ(x) ~ Vx(P V Q(x)) is interesting as an example of a formula not con- 
taining “i, but which (by IM § 80), cannot be proved without using Axiom Schema 
8 . 
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(c) (-iVxP(x) V3xQ(x)) &(R 3 3xS(x)). Come out with only two quanti- 
fiers. 

(d) VxP(x) ~ 3xQ(x). Use *63a first. 

25.12°*. (a) Prove the theorem: 86 For each formula E and O-place ion P: 
1- E ~~ (P & F,) V (— iP & F 2 ) where F t is P or -iP or a formula not containing 
P (/ = 1, 2). Hint: Use *49, *40a-*48 (in § 24), *75, *76, to obtain F 1; F 2 
as described such that P 1- E ~ F t and ~ iP f E ~ F 2 . 

(b) The theorem comprises 9 ( = 3-3) cases, since each of F x and F 2 can 
have one of three forms. Simplify (P & Fj) V ( — iP & F 2 ) in 8 of these cases. 

(c) State and establish a similar theorem with two 0-place ions P, and P;> in 
place of one P. 

*§ 26. Applications to ordinary language: sets, Aristotelian 
categorical forms. We continue from §§ 14, 15. Now a richer system 
is available for our logical analyses: the predicate calculus, or more specif- 
ically the restricted, one-sorted, classical predicate calculus, §§ 16, 17. 

In translating verbal expressions into logical symbolism, we must as 
before (§ 14) set clearly before ourselves the interpretation, or possible 
models, of the symbolism. 

So, when a translation using only the propositional calculus will not 
suffice, we must be able to regard all the sentences for the given application 
of the predicate calculus as speaking about the members of some one non- 
empty set D. After translating, we will test the translated argument for 
validity without assuming that we know what nonempty set D is. There- 
fore, we are not compelled before translating to be specific about what D 
Jis, so long as we agree that there is some nonempty set D to which all the 
/objects under discussion belong, and which can serve as the range of the 
p variables in the translations of statements using “all”, “some”, etc. 

1 Having envisaged a D, we must next be able to select a list of predicates, 
i.e. of propositional functions (each of some number/? 7> 0 of variables), 
as the independent basic predicates which the sentences concern; no one 
of these predicates is to be regarded (for the purpose of our analysis) as 
composed out of other predicates. Each of these predicates, when each of 
its variables becomes (or takes as value) a member of D, is to become (or 
take as value) a proposition, which is either true or false (but not both). 
We then establish the symbolism for our application of the predicate 
calculus by picking prime predicate expressions or ions, which we usually 
write from the beginning with attached name form variables, to express 
these predicates. 

In these two steps, of picking a D and of picking predicates over D 
which take values that are each either true or false, we may be representing 
what we believe to be the actual situation, or we may be making simplifying 
assumptions to enable us to reason exactly. 
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Let us digress for a moment to introduce a little standard terminology 
about sets. We do not undertake now to criticize the notion of a “set” or 
“collection” or “aggregate” or “totality” or “class” C, as constituted by 
objects thought of together and said to “belong” to C or to be “members” 
of C or “elements” of C. We assume the reader has a sufficient under- 
standing of this for present purposes. We write “a: e C” to say that a; is a 
member of C; and “a; ^ C” to say that x is not a member of C (i.e. “a; C” 
abbreviates - 1 x e C). 

A set is to be completely determined by what objects are members of 
it. Thus Cj and C 2 are the same set exactly if the same objects belong to 
each ; in symbols Cj = C 2 if and only if Vx[ x e C, ~ x e C 2 ], 

A set C is included in a set D or C is a part of D or C is a subset of D, 
or in symbols C £ D, exactly if each member of C is a member of D. In 
symbols, C £ D if and only if Vz[£ e CD x e D]. (Then C — D if and 
only if C £ D & D s C.) For any set D, this definition makes D itself a 
subset of Z); we call D the improper subset of D. Any other subset C of Z>, 
i.e. any set C such that C £ D & C ^ D (i.e. every member of C is a 
member of D, but some member of D is not a member of C), we call a 
proper subset of D, and write C ■= Z). The set 0 containing no objects as 
members, which we call the empty set, is a subset of every set Z>. 89 

For example, suppose D is the set of three elements a, b, c; in symbols, 
D = {a, b, c}. Then D has eight subsets : 


{ }, {a}, {b}, {c}, {a, b}, {a, c}, {b, c), {a, b, c}. 

The first { } is the empty set; the last {a, b, c} the improper subset of D; 
and the second, third and fourth {a}, {b}, {c} are unit sets, i.e. sets with 
just one element each. 

The domain D for an application of the predicate calculus may also be 
called the universal set or the universe or universe of discourse. It is to be 
simply a set, fixed for the given application, which contains as members all 
the objects which we are considering in that application. Thus, if a given 
piece of reasoning concerns natural numbers 0, 1 , 2, . . . and no other 
objects, then D can be the set of all the natural numbers, but need not 
include shoes, ships, sealing wax, cabbages and kings. In the following 
example, which we didn’t successfully analyze in § 14, it wouldn’t do to 
take D to be just the natural numbers. 

Example 19. “All men are mortal. Socrates is a man. Therefore 
Socrates is mortal.” If D, including Socrates, whom clearly we are talking 
about, were just all men, the second sentence would be unnecessary to the 
argument. If the set D of all the objects we are talking about, including 

8 ’ Some books, including IM, use “<=” as we are using “c”. Our “0” is written 
O in IM. 
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Socrates, were by definition exactly the mortal objects, the argument itself 
would be redundant. So we had better think of D as consisting of, say, the 
“beings”, which includes men, mortal objects and maybe more (as in 
Greek mythology, gods). D could include natural numbers and minerals, 
if "x is mortal” is construed as meaningful when y is a natural number 
or a mineral ; but this seems far fetched. Using H(x) to express "x is a man” 
(human being), M(x) to express ‘'x is mortal”, and s to express “Socrates”, 
the argument can be symbolized thus: 90 

(1) Vx[H(x) 3 M(x)], H(s) M(s). 

To say that this is correct reasoning would mean that 

(2) Vx[H(x) 3 M(x)], H(s) 1= M(s) 

(cf. § 20 ). We have chosen (2) rather than 

Vx[H(x) 3 M(x)], H(s) k* M(s), 

because in (I) as a rendering of the verbal argument the variable s is 
supposed in both H(s) and M(s) to name the same member of D , i.e. 
Socrates. The reader will have no trouble in establishing (2) directly, or 
(by beginning § 23) via 

(3) Vx[H(x) 3 M(x)j, H(s) I- M(s). 

Since thus far we have been doing without “constants” as symbols 
separate from variables, it was important here to give s in (1) and thence 
in (2) the conditional interpretation ((I) in §20). We could avoid any 
uncertainty about the status of free variables in premises by writing the 
premises in the form V'A^ . . . , V'A„, where V' is closure with respect to 
all variables not intended to be held constant. This probably corresponds 
best to usage in ordinary language ( outside of mathematics), where con- 
structions equivalent to the use of free variables not held constant scarcely 
occur. However, in mathematics premises are very often stated with free 
variables having the generality interpretation ((11) of § 20), e.g. x+y = y+x 
and x<y & //<z 3 x<z. Hence we see no reason not to allow this practice 
here. We can avoid ambiguity then by writing the variables x,,...,x, 

""Our convention is to use Roman letters (capitals and small letters) to name linguistic 
objects (like formulas and variables), while using italic letters to name mathematical 
or empirical entities (like predicates, sets, and members of the domain D). 

It seems preferable to us here to write “H(x)” as a name for “ x is a man” (which 
makes “x” simply a name for "x") than to mix the Roman and italic letters by writing 
‘ But we might also want to use “ H(x )” as a shorter notation for the predicate 

"x is a man”, whereupon "H(x)“ can be a name for either of the linguistic expressions 
“//(. <•)” and "x is a man” we choose. 
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which are not necessarily held constant as superscripts on just as 
we did on “f=” beginning in § 20, and on “h” in § 21. 

The requirement that all the objects considered belong to one nonempty 
set D, called the domain, does not prevent our talking about the members 
of other sets C x , C„, C 3 , . . . , provided those other sets are included in D 
(i.e. provided c D, Qs A C 3 £ D, . . .). Thus we can regard 
Example 19 as dealing with three sets D = {the beings}, H = {the 
humans} = {the x’s in D such that H(x)} = xH(x), M = {the mortals} = 
{the x's in D such that M(x)} — £M(x). 90 As this illustrates, to each one- 
place predicate C(x), where the variable x ranges over D, we can define a 
subset C of D as containing just those x's for which C(x) is a true proposi- 
tion; in symbols, C = a-C(:b). 91 

Inversely, if we have first introduced some subset C of D, we can define 
a predicate C(x) by C(x) ~ x e C. In particular, if the sets // of humans 
and M of mortals (as subsets of D) were first introduced, then “H(x)” and 
“M(x)” could be considered as standing fora; e H and x e M respectively. 02 

Notice that in (!)— (3) the subsets H and M of D are mentioned via the 
ions H( — ) and M(— ). But D itself is not mentioned explicitly; D enters 
implicitly, as the range of the variable x, and also of s in the validity treat- 
ment, where we disregard its interpretation as naming Socrates. 

Example 20. “Every odd natural number is a difference of two 
squares. 5 is an odd natural number. Therefore 5 is a difference of two 
squares”. For this argument, no objects need to be considered which are 
not natural numbers. So let us take the domain D — {the natural numbers}. 
Using O(x) for “ x is odd”, and D(x) for "x is a difference of two squares”, 
and f to express 5, we obtain the symbolization: 

(1) Vx[0(x) D D(x)}, O(f) .-. D(0- 

Apart from the letters used, this is the same as (1) in Example 19; so the 

•' Other notations for iC(x) are “{.c:C(x)}” and “{z | C(z)}”. 

” 2 We have been considering two predicates CiOr) and C 2 (x) to be the same if and only 
if, for each x, C, (x) and C a (x) are the same propositions, i.e. have the same meaning. 
So it could happen that C x (x) and C'./x) are different predicates, but, for each x in D, 
C\(x) is true if and only if C t (x) is true, which would make xC,(x) and xC 2 (x) the same 
set. 

If we are to maintain this point of view while defining a predicate C(x) by 
C(x) = xeC, we must think of the set C as given together with some particular way of 
describing it which determines the meaning of z 6 C. 

This distinction makes no difference in classical logic, where in considering whether a 
logical relationship holds the predicates are in effect subjected to the X-ray of Footnote 
67 § 17, leaving only logical functions. 

The set C = xC(x) can be called the extension of the predicate C(x). A predicate as 
we have been considering it is an intensional concept, since the meaning or "intension” 
matters, while a logical function is extensional. 
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argument is valid. (The demonstration of validity there encompasses both 
the argument about Socrates and the argument about five.) The D of this 
application is quite precise (or so mathematicians usually think), while 
that of Example 19 is a rather vaguely defined totality. 

This example is similar to the second illustration [2] for material inter- 
pretation early in § 2. We weren’t ready there to talk explicitly about a 
domain D (say the positive integers > 1) or to use a quantifier Vx. So 
there we let n be a number which was fixed by being written on a paper 
in your pocket, but was unknown to me. So my assertion could be con- 
sidered as simply O 3 F, where O is “« is odd” and F is “®“ + y n is 
factorable” for that fixed n. But since I was in ignorance of the particular 
number n in your pocket, I had to consider all the possibilities. Using our 
present notation, 1 can now say that, before risking a wager with you, I 
convinced myself of Vx[0(x) ID F(x)], where D = {all positive integers > 
1}, O(x) is “x is odd” and F(x) is “ x n + y n is factorable”. Thence 1 passed 
by V-elimination to O(x) 3 F(x) (or briefly O 3 F) with x referring now 
to the number n in your pocket. In this illustration, material implication 
3, used in combination with a universal quantifier, enables us to make the 
statement Vx(0(x) 3 F(x)) about just the odd positive integers > 1, 
while using a variable x that ranges over the set of all positive integers 
> 1. Wc are able to do this because we made the truth table for A 3 B 
give t whenever A is f. The relationship between propositional functions 
0(x ) and F(x ) (rather than just propositions) expressed by Vx[0(x) 3 F(x)] 
is called “formal implication”. Similarly, Vx[P(x) ~ Q(x)] expresses 
“formal equivalence” between propositional functions P(x) and Q{x). — 
In the logic of Aristotle (384-322 b.c.) and his followers, down into 
the nineteenth century, a paramount role was given to four forms of 
statement called “categorical”. We show these forms below at the right, 
and our symbolizations of them at the left. Here S(x) stands for “ x pos- 
sesses the property S” and P(x) for “x possesses the property P”. The first 
premises in Examples 19 and 20 illustrate the A form. 93 In these forms, 
when S is expressed in the singular, “are” becomes “is”. “Every” or “each” 
can replace “all”. 

A Vx[S(x) 3 P(x)]. All S are P. 

•(Only P are S.) 

E ~i3x[S(x) & P(x)] No S are P. 

(equivalently by *82a, *55b, *59: (All S are not-P.) 

Vx[S(x) 3 — iP(x)]). 

I 3x[S(x) & P(x)]. Some S are P. 


O 3x[S(x) & iP(x)]. 


Some S are not P. 
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We have stipulated that the range of our variables, here x, be some non- 
empty set D. But, in the symbolizations of the first two forms, the effect 
of “S(x) 3” is to remove from our attention the part of D outside the 
set S, so that the assertion that P(x) holds (first form) or fails to hold 
(second form) is made only for x e S. 

If for the moment we allow a second sort of variables, Vx[S(x) 3 P(x)] 
where x ranges over D is synonymous with V£P(£) where <; ranges over S 
(with S £ £>); and Vx[S(x) 3 -iP(x)] is synonymous with V?-iP(?). 
Similarly, in the third form 3x[S(x) & P(x)] is synonymous with 32P(i;); 
and in the fourth 3x[S(x) & — iP(x)] is synonymous with 3£-iP(l;). 

It would not be worth while to complicate the symbolism by introducing 
a second sort of variables, unless we were to make a great deal of use of 
them. (For simplicity in this book, we are avoiding them in any case.) So 
we now revert to the one-sorted predicate calculus, in which we can get 
the effect of a variable over a subset S = xS(x) of the domain D by pre- 
fixing S(x) 3 to the scope of a universal quantifier and S(x) & to the 
scope of an existential quantifier. 

We excluded the empty set as a possible choice of D. This only means 
that the (complete) range of our variables is not to be empty; there is to 
be at least one value of x. But when we confine our attention to a subset 
S = xS(x ) of D by using Vx[S(x) 3 . . .] or 3x[S(x) &...], this subset S 
may be empty. I may develop the theory of angels in our form of the 
predicate calculus; but if 1 am in doubt whether angels exist, I shall first 
have to embed the set S of angels in some totality D that I am sure is not 
empty. 

Under our translations, when S is empty, “All S are P” and “All S are 
not-P” are true, while “Some S are P” and “Some S are not P” are false. 
For example, on the supposition that there are no angels, we can truth- 
fully assert “All angels have wings” and also “All angels are wingless” or 
equivalently “No angels have wings”. 

This is a point on which ordinary language is ambiguous, just as in 
everyday speaking “or” is sometimes intended inclusively and sometimes 
exclusively. People wouldn’t ordinarily say “All S are P” if they already 
knew that 5 is empty; but they might say it when they were in ignorance 
that S' is empty or had not considered that possibility. Then if confronted 
with the fact or possibility that S is empty, they might disagree on whether 
to regard “All S are P” as true in that case (as we do) or as false. Indeed, 
Aristotle took the latter interpretation; i.e. he took “All S are P” to 

” 3 The letters “A”, “E”, “I”, “O” come from the italicized vowels in the Latin 
“affirmo” (affirm) and "nego” (negate). The A and I forms are “universal affirmative” 
and “particular affirmative”; and the E and O forms are “universal negative” and "par- 
ticular negative”. The letters “S” and “P” correspond to the two parts of the sentence 
considered respectively as “subject” and “predicate”. 
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include the assertion that there are some S. Under Aristotle’s interpreta- 
tion, the translation of “All S are P” into our symbolism is 
3xS(x) & Vx[S(x) 3 P(x)]. UJ When S is nonempty, i.e. when 3xS(x) is true, 
this is equivalent to our Vx[S(x) 3 P(x)]. (By *45 in § 24, 

3xS(x) h 3xS(x) & Vx[S(x) 3 P(x)] — ■ Vx[S(x) 3 P(x)].) 

Our reason for preferring to use “All S are P” and “All S are not-P” in 
the respective senses Vx[S(x) 3 P(x)] and Vx[S(x) 3 ~iP(x)] is that these 
senses are simpler and thus more useful. (In translating someone else’s 
English, we must be ready to prefix 3xS(x)& if that is what he meant.) 
This simplicity is emphasized by writing them in the notation of set theory. 
Indeed, Vx[S(x) 3 P(x)] can be expressed as “5 £ P”; and 
Vx[S(x) 3 — iP(x)] as “5 £ P" where P — x~\P(x). We call P the com- 
plement of P ( with respect to our universal set D). 

Let us introduce the further set-theoretic notations 
S n P = x(S(x) & P( x)) (the intersection of S and P), and 
S U P = .f (S(x) V P(x)) (the union of S and P). 05 Recalling that 
C = {the empty set} = .t[S(a;) & ~iS(x)], we can ^express 3x[S(x) & P(x)] 
as “5 n P ^ 0 ” and 3x[S(x) & -iP(x)j as “S n P ^ 0 ”. 

Aristotle and we and the man on the street usually agree in rendering 
“Some S is P” by 3x[S(x) & P(x)], and “Some S is not P” by 
3x[S(x) & — iP(x)]. If there are no angels, “Some angels have wings” and 
“Some angels are wingless” are both false in everyone’s usage. 

However, in everyday language, “some” (especially if accented) is some- 
times used to mean “some but not all”. When a politician says “ Some 
politicians are crooks”, he means “Not every politician is a crook, although 
some are”, i.e. ~iVx[P(x) 3 C(x)] & 3x[P(x) & C(x)], where the first con- 
junctand is the part he is primarily interested in communicating. 

In everyday language, “all” or “some” may be omitted. “Men are 
mortal” means “All men are mortal”. But “Men have climbed Mt. 
Everest” means “Some men have climbed Mt. Everest”. 

*§27. Applications to ordinary language: more on translating 
words into symbols. As has been amply illustrated in §§ 16 fif., the 
universal and existential quantifiers can be combined with each other and 
with propositional connectives in all sorts of ways; and modern logic has 

M If a person on the street has said, “All S are P”, and we then raise a doubt about 
whether 3\S(x), he could respond in two ways: (1) “What I said was contingent on there 
being some S: i.e. 3xS(x) 3 was intended to be prefixed to my statement.” (2) “I was 
meaning to claim that there are some S; i.e. 3xS(x) & was intended to be prefixed.” 
These lead to the respective translations (I') 3xS(x) 3 Vx[S(x) 3 P(x)] and 
(2 ) 3\S(x) & Vx[S(x) 3 l>(x)J. But (I') is equivalent to our translation 
Vx[S(x) 3 P(x)]. 

^ Sometimes 5 n P is called the “meet”, and S u P the' “join”, of S and P. 
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gone far beyond the analysis of statements of just the four categorical forms 
A, E, f, O of Aristotle. 

As in § 14, we give a list of expressions at the right usually translatable 
as shown at the left. In the translation of expressions using pronouns like 
“everybody”, “somebody”, etc., a variable is to be introduced. This is 
illustrated by our examples (a)-(g) at the beginning of § 16. 

VxA(x) For all x, A(x). For every x, A(x). For each x, A(x). 

For arbitrary x, A(x). Whatever x is, A(x). A(x) always holds. 
Everyone is A. Everybody is A. Everything is A. 

Each one is A. Each person is A. Each thing is A. 

3xA(x) For some x, A(x). For suitable x, A(x). 

There exists an x such that A(x). 

There is an x such that A(x). 

There is some x such that A(x). 

Someone is A. Somebody is A. Something is A. 

For at least one x, A(x). At least one is A. 96 

When negations and quantifiers appear together, care is necessary to 
get them in the intended order. A “not” before or after the quantifier 
seems unambiguous in examples like the following. (The list could be pro- 
longed by using others of the verbal expressions for the quantifiers.) 

-iVxA(x) Not for all x, A(x). 

A(x) does not hold for all x. A(x) does not always hold. 
Not everyone is A. 

Vx-iA(x) For all x, not A(x). 

A(x) always fails. 

Everyone is not A. 

•~i3xA(x) There does not exist an x such that A(x). 

There does not exist any x such that A(x). 

There exists no x such that A(x). 

There is no x such that A(x). 

There isn’t any x such that A(x). 

There isn’t anyone who is A. 

3x~iA(x) For some x, not A(x). 

Someone is not A. 

»« To translate “At most one is A”, “Exactly one is A”, “At least two are A”, etc., 
we must wait till Chapter III. 
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Now consider “All S are not P”. The word order here, which contrasts 
with “Not all S are P”, would seem to call unambiguously for the transla- 
tion Vx[S(x) 3 — iP(x)]. However in examples like (a) “All that glisters is 
not gold” (Shakespeare, Merchant of Venice (1596-7), II, 7) and (b) "All 
women are not gold diggers”," 7 the intended meaning is evidently 
-~iVx[S(x) 3 P(x)] or equivalently 3x[S(x)& ->P(x)]. B8 It would seem that 
in these examples the clear logical structure of the language has been 
corrupted. In (a) Shakespeare, and Chaucer before him, needed their 
meter.™ In (b), the speaker could have avoided the ambiguity by saying 
“Not all women are gold diggers” if she meant what we think she did, or 
“Each woman is not a gold digger” (or equivalently “No woman is a gold 
digger”) if she intended the meaning which is even more favorable to the 
fair sex. (Here we have a context in which “all” and “each” do not behave 
alike.) Of course, our job in translating is to take the language as it is used 
or misused, and attempt to supply the logical expression that fits best. 

Because the meaning of “any” depends on the context, we left “any” 
out of the preceding lists (except for ~t3xA(x) when “There exists” or 
“There is” indicates clearly the existential quantifier). When an any- 
expression stands by itself, “any” has the same logical force as “all”. 

VxA(x) For any x, A(x). 

Anyone is A. Anybody is A. Anything is A. 

But when an any-expression D is put into either of the contexts ~jD or 
D 3 E (cf. preceding Theorem 24), the meaning of “any” normally alters 
from “all” to “some”, thus. 

-i3.\A(x) Not for any x, A(x). 

For no x, A(x) [no s not any]. 

No one is A. Nobody is A. Nothing is A. 

3xS(x) 3 P If S(x) for any x, then P. 

If anyone is S, then P. 

S 3 VxP(x) If S, then, for any x, P. 

Here are a few specific examples, “J would do that for anyone”: VxA(x). 
But “I wouldn’t do that to anyone”: ~i3xA(x). “If any man is godfearing, 
he is just”: Vx[S(x) 3 P(x)]. But “If any man is just, Aristides is just”: 

” Example from Ambrose and Lazerowitz 1 948 p. 236. 

*“ To avoid this ambiguity, we are hyphenating "not-P” when we write “All S are 
not-P” intending Vx[S(x) 3 — |P(x)]. 

,s Bartlett, “Familiar Quotations”, 13th ed. 1955, p. 77b says that Tyrwhitt says that 
Chaucer took it from the “Parabolae” of Alanus de Insulis (d. 1294): “Non teneas 
aururn totum quod splendet ut aurum (Do not hold everything as gold which shines 
as gold)”. Here it is unambiguously — iVx[S(x) 3 P(x)]. 
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3xS(x) 3 P (here the any-expression is just the antecedent of the implica- 
tion, not the whole implication). “If Vidkun Quisling was a patriot, then 
any man is a patriot”: S 3 VxP(x) (here the any-expression is the conse- 
quent of the implication, which does not give “any” the force of “some”). 
There is a simple explanation of the rule by which “any” alters from “all” 
to “some” in the D of ~ iD or D 3 E but not of E 3 D. “Any” before “x”, 
“one”, “body”, “thing”, etc. shall indicate that the “x”, “one”, ... is 
chosen arbitrarily (in “any” way) from the domain D. But this choice is 
to be made for the sentence as a whole, i.e. treating the “x”, “one”, ... as 
a free variable under the generality interpretation ((II) in §20). This 
amounts to using “any” like “all”, “every” and “each” except that for 
“any” the quantifier Vx is placed at the beginning of a composite sentence, 
rather than where the “any” actually appears, thus: 

VxA(x) For any x, A(x). 

Vx[S(x) 3 P(x)] Any S is P. 

Vx“iA(x) Not for any x, A(x). 

Vx[S(x) 3 P] If S(x) for any x, then P. 

Vx[S 3 P(x)] If S, then, for any x, P(x). 

Now when we move the Vx inward if possible in the composite expressions 
by *82a, *96 or *95, we get the results given by the rule. It may appear to 
be an exception that “Any S is P” is Vx[S(x) 3 P(x)] ; but “Not any S is P” 
and “No S is P” are ~i3x[S(x) & P(x)] rather than ->3x[S(x) 3 P(x)]. But 
if we use a variable £ over 5 to write “Not any S is P” as V^-iP(^), and 
then use the rule, we get ~i3x[S(x) & P(x)] upon changing 3$ to 
3x[S(x) &...]. Clearly “any” is tricky. Not every user can be counted on 
to follow the above principles invariably, so the translator must be on 
guard. 

The indefinite article “a” or “an” sometimes has the force of “all”, 
sometimes of “some”. “A child needs affection”: Vx[C(x) 3 A(x)]. “A man 
was here”: 3x[M(x) & H(x)]. (Cf. end § 26.) 

The scopes of quantifiers in relation to implications are usually clear. 
“For all x, if S(x), then P(x)” and “For all x, S(x) only if P(x)” are 
Vx[S(x) 3 P(x)]. “If, for all x, S(x), then P(x)” and “P(x) if, for all x, S(x)” 
are VxS(x) 3 P(x). 

Here are a few examples of translating quantifiers with & or V. 

3x[S(x) & P(x)] Someone is S and P. 

3xS(x) & 3xP(x) Someone is S and someone is P. 

Vx[S(x) V P(x)] Everyone is S or P. All except S are P. 

VxS(x) V VxP(x) Everyone is S or everyone is P. 
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“No cats or dogs are allowed in apartments” is clearly 
— 1 3 x [( C'( x ) V D(x)) & A(x)] or equivalently Vx[C(x) V D(x) 3 ~iA(x)]. 
But "All cats and dogs must be licensed” would be mistranslated as 
V\[C(x) & D(x) 3 L(x)] (the set f[C(.c) & /)(.»)] is empty); the correct 
translation is Vx[C(x) 3 L(x)] & Vx[D(x) 3 L(x)], or equivalently 
Vx[C(x) V D(x) 3 L(x)] ( *87 and Exercise 13.8). 

Sometimes “all” is used not as a quantifier but to describe a set. “All 
the kings horses and all the kings men could not put Humpty Dumpty 
together again” presumably does not mean simply that no one of the 
horses and men working alone could effect the repair, as the translation 
Vx[H(x) V M(x) 3 — iP(x)] or equivalently 

Vx[H(x) 3 ~iP(x)] & Vx[M(x) 3 — iP(x)] would say. Similar usage occurs 
with “and”. 100 “John and William couldn’t push the car out of the ditch” 
can be translated — iP({j, w}) where P(x) is “x could push the car out”, or in 
strictly predicate calculus symbolism — »P(t) where t stands for the team 
John + William. But “John and William will pass this course” is 
P(j) & P(w) where P(x) is “x will pass this course”;- John and William are 
not supposed to collaborate in writing the final examination. — 

Our discussion early in this section of picking the domain D led us 
naturally to the consideration of subsets of D such as xS(x) and xP(x), 
and thence to the Aristotelian categorical forms and how to translate into 
them. But after picking D, and before introducing quantifiers and propo- 
sitional connectives, we must pick the basic predicates and select ions or 
prime predicate expressions to stand for them. 

Example 21. “When I am tired and hungry, 1 want to go home. Now 
1 am tired and hungry. So I want to go home now.” Let E(x) be “at time x, 
I am tired and hungry”, H(x) be “at time x, I want to go home” and n be 
“now”. 

(1) Vx[E(x) 3 H(x)], E(n) .*. H(n). 

Valid, as already noted in Examples 19 and 20. 

Example 22. “When I am tired, I want to go home. When I am hungry, 
1 want to go home or to a restaurant. Now I am tired and hungry. So I 
want to go home now.” 

(1) Vx[T(x) 3 H(x)], Vx[Hu(x) 3 H(x) V R(x)], T(n) & Hu(n) .’. H(n). 

Again valid. The second premise is redundant, i.e. the argument is valid 
without it. 

'** 1 he preceding anomaly about “cats and dogs" is connected with the use (some- 
times) of "and” to describe a union of sets : (cats and dogs) - i(C(x) V D{x)) = C 'J D. 

In the next example (John and William) we could use instead the translation P(j, w) 
with a binary predicate, but then we would need a ternary predicate for “John, William 
and Henry couldn’t push the car out”. 
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In Example 21, it sufficed to represent the compound notion “tired and 
hungry” simply by an ion E(x), since the argument was valid without further 
analysis. In Example 22, this wouldn’t have sufficed. Example 21 illustrates 
that to confirm the validity of an argument, it may not be necessary to 
analyze it to the maximum possible extent. On the other hand, to establish 
the invalidity of an argument in a given system (in Chapter I, the proposi- 
tional calculus; here, the predicate calculus), we must carry out the fullest 
analysis possible in the system. (We first remarked this in § 3 when we 
proved Theorem I and refuted its converse.) 

In retrospect, Example 15 at the end of § 15 might now seem to call 
for predicate letters P(x), Q(x), R(x) and universal quantifiers, with the 
conclusion becoming 

k Vx[P(x) 3 Q(x)] & Vx[-i(Q(x) & R(x)) V P(x)] & Vx-i[R(x) & P(x)] ~ 
Vx[P(x) 3 Q(x)] & Vx[Q(x) 3 -iR(x)]. However, we analyzed it success- 
fully there by thinking of x as naming a fixed (but unspecified) member of 
the club. (Cf. Exercise 17.6.) 

Example 23. Consider the Aristotelian syllogism “Barbara”: “All 
M are P. All S are M. Therefore all S are P.” Using the Aristotelian A 
form, this becomes: 

(1) Vx[M(x) 3 P(x)], Vx[S(x) 3 M(x)] Vx[S(x) 3 P(x)]. 

Its validity means 

(2) Vx[M(x) 3 P(x)], Vx[S(x) 3 M(x)] k Vx[S(x) 3 P(x)]. 

But simply by *2 in Theorem 2 h and modus ponens, 

(3a) M(x) 3 P(x), S(x) 3 M(x) 1- S(x) 3 P(x) 

in the propositional calculus. Thence using V-elims. and V-introd., 

(3) Vx[M(x) 3 P(x)], Vx[S(x) 3 M(x)] h Vx[S(x) 3 P(x)]. 

More essential use is made of the predicate calculus with quantification 
of one-pla ce predic a te s in Example 19 and th e syllogisms Baroco, Festino, 

etc. (Exercise 24.5). ~~ — 

Now we give an example in which two-plac e predicates are essential. 
This example is beyond the resources of the traditktmrFAristotelian logic, 
which concerned itself with the syllogisms and related matters, which we 
have expressed in terms of one-place predicates. 

Example 24. “The relation x < y is both transitive and irreflexive. 
Therefore it is asymmetric.” In our symbols, 

(1) VxVyVz(x<y & y<z 3 x<z), Vx~ix<x VxVy(x<y 3 -iy<x), 
or equivalently with free variables in the formulas (imitating the notation 
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of mathematics text books, but making the generality interpretation of 
x, y, z explicit by our superscripts) 

(1') x<y & y<z 3 x<z, ~ix<x ,\ xyz x<y 3 ~iy<x. 

To establish the validity of (1) and (I'), it will suffice (by Theorem 12,, d 
and beginning § 23) to establish 

(3') x<y & y<z 3 x<z, — ix<x I- Xyz x<y 3 ~iy<x 

(where “x<y & y<z 3 x<z, ~ix<x f xyz ” means _ 

"VxVyV/.(x<y & y<z 3 x<z), Vx~ix<x h”). We do this in the format 
(B,) of §§13, 25. Assume VxVyVz(x<y & y<z 3 x<z), Vx”ix<x. By 
V-elims., x<y&y<x3x<x and ~ix<x, whence by propositional cal- 
culus x<y 3 ~iy<x. In this example the original notation “<” is concise 
enough so we use it directly. 

Exi.KCtSLS. 27.1. Translate each of the following arguments into 
logical symbolism, and establish the validity or invalidity of the result. 

(a) Everyone loves himself. Therefore someone is loved by somebody. 

(b) Everybody loves Jane. Therefore everybody is loved by someone. 

(c) No animals are immortal. All cats are animals. Therefore some cats 
are not immortal. 

(d) Only birds have feathers. No mammal is a bird. Therefore each 
mammal is fealherless. 

(e) Some students are studious. No student is unqualified. Therefore some 
unqualified students are not studious. 

(f) Each politician is a showman. Some showmen are insincere. Therefore 
some politicians are insincere. 

(g) Nothing effective is easy. Something easy is popular. Therefore some- 
thing popular is not effective. 

(h) All of his friends are devoted. Some of his friends are insincere. No 
insincere person can be devoted. Therefore all of his friends are crooks. 

(i) There isn’t anyone who would believe that. Therefore thejudge wouldn’t 

believe it. 

(j) Any fool could do that. 1 cannot do that. Therefore I am not a fool. 

(k) If anyone can solve this problem, some mathematician can solve it. 
Cabot is a mathematician and cannot solve the problem. Therefore the 
problem cannot be solved. 

(!) Any mathematician can solve this problem if anyone can. Cabot is a 
mathematician and cannot solve the problem. Therefore the problem 
cannot be solved. 

(m) Anyone who can solve this problem is a mathematician. Cabot cannot 
solve this problem. Therefore Cabot is not a mathematician. 
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(n) Anyone who can solve this problem is a mathematician. No mathema- 
tician can solve this problem. Therefore the problem cannot be solved. 

(o) If any of the numbers (strictly) between 1 and 101 divides 101, then a 
prime number less than 11 divides 101. Each prime number less than 11 
does not divide 101. Therefore no number between 1 and 101 divides 101. 

(p) The person responsible for this rumor must be both clever and un- 
principled. Cabot is not clever. Lowell is not unprincipled. Therefore 
neither Lowell nor Cabot is responsible. 

(q) No one can interpret this message unless someone can supply the code. 
Therefore there is someone who can interpret this message only if he can 
supply the code. 

(r) Every liberal advocates changes. Some conservatives favor no one who 
advocates changes. Therefore some conservatives favor no liberal. 

(s) A man is an animal. Therefore the head of a man is the head of an 
animal. (Let M(x), A(x), H(y, x) be “x is a man”, “x is an animal”, “y is 
the head of x”, respectively.) 101 

(t) One person being the father of another is an asymmetric relation; i.e. 
for any persons x and y, if x is the father of y, then y is not the father of x. 
Therefore, no person is his own father. 

(u) * If each of two persons is related to a third person, the first person is 
related to the second. Everyone is related to someone. Therefore, if John 
is related to William, and William is related to Edith, then John is related 
to Edith. 102 

(v) For any persons x and y, x is brother of y if and only if x and y are 
male and x is a different person than y and x and y have the same two 
parents. Therefore, if x is brother of y, then y is brother of x. 

(w) Hope is not lost. Therefore all is not lost. 

27.2. In the following enthymemes (premise(s) or conclusion missing, 
§ 15), attempt to supply what is missing to make a valid argument. Is the 
result sound ? 

(a) Only the brave deserve the fair. She is fair. He is not brave. 

(b) Adults are admitted only if accompanied by a child. 1 am admitted. 
Therefore I am either a child or accompanied by a child. 

(c) John is taller than Susan. Susan is taller than Peter. Therefore John 
is taller than Peter. 

(d) San Francisco is west of New York. Tokyo is west of San Francisco. 
London is west of Tokyo. Therefore London is west of New York. 

101 Given by DeMorgan 1847 p. 114 as an inference not possible by Aristotelian 
syllogistic. 

102 The solution is essentially the same as of Exercise 29.3 (f). 
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*§28. Functions, terms. In this chapter we extend the object 
language (or increase the part of it we take into account) in two ways. 
These extensions can be made separately or simultaneously. 

First (in this section), we add expressions for functions whose values 
are in the same domain D as the arguments , i.e. values of the independent 
variablc(s), in contrast to the predicates (i.e. propositional functions), 
whose values arc propositions (cf. Footnote 51 § 16). Such “functions” 
play an important role in mathematics. For example, when D is the natural 
numbers (0, I, 2, . . .}, we use as such functions x+ 1, x+y, xy, x v , x], etc. 
When D is the real numbers, we use also x—y, Vx 2 + y 2 , sin a;, e x , etc. 
We include individuals (i.e. members of D) as functions of zero variables 
(0-place functions), such as 0 and 1 when D is the natural numbers, and 
also 7t and e when D is the real numbers (just as we included propositions 
as 0-place predicates). 

To make this extension, we assume that the object language has expres- 
sions for such functions, which expressions shall retain their identity but 
not be analyzed. These expressions we call prime function expressions or 
mesons™' and we denote them by “f”, “f(— )”, “f(— , — )”, “f(— , — , — )”, 

• • . “g”, "g( -)”> “g(— — “g(— , — , — )” Putting variables into 

these, we have name forms for them, or prime function expressions (or 
mesons) with attached variables. We can be brief here, as the treatment 
parallels that of prime predicate expressions in § 16. 

Now we describe the larger class of expressions, called “terms”, which 
name functions (using variables, except in the case of individuals). 105 The 
terms shall comprise exactly the variables and the additional terms obtain- 
able thus: For each n > 0, each «-place meson f( — , . . . , — ), and each 

103 The predicate calculus with equality is an important chapter of logic. But what will 
be needed in Part II of the book will be repeated or cited piecemeal. So the student who 
has not time for both can skip this chapter. 

We pick the short name “meson” (akin to “atom”, “molecule”, “ion”), without 
making any pretense that there is an analogy with chemistry or physics. 

lu5 When the variables are interpreted as expressing members of D, the formulas 
express propositions, 53 and the terms express members of D. However, we have nothing 
here for terms analogous to the generality interpretation of free variables x,, . . . , x„ in 
formulas, by which a formula A can instead express the proposition expressed by 
V'A (§ 20). 
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list r„ . . . , r„ of n terms r 1; . . . , r„ already constructed, f(r x , . . . , r„) shall 
also be a term. 

Now we take over the definition of formula in § 16, but allowing the 
r„ . . . , r„ in the definition of “prime formula” or “atom” to be any 
terms, instead of simply any variables. (If there are no mesons, the notion . 
of “term” just defined reduces to “variable”, so the present notion of 
“formula” reduces to the one in § 16.) 

For each choice of a (nonempty) domain Z>, each term has a table with 
values in D, and each formula has a truth table. The tables must be entered 
with all //-place functions with arguments and values in D as value of each 
//-place meson (for // = 0, simply with all members of D). 

Example I. We give the tables for D =■ {1, 2} of the term f(f(x)) and 
of the formula Vx[P(f(f(x))) O -iP(f(x))]. First, we list all possible 1 -place 
functions with values in D (and repeat from § 17 Example 1 the list of 


1 -place logical functions). 64 

x /i(x) / 2 (x) / 3 (x) / 4 (x) I,(x) L(x) l 3 (x) I 4 (x) 



1 


1 1 

2 

2 t 

t f f 


2 


1 2 

1 

2 t 

f * f 


f(x) 

X 

f(f(x)) 


P(x) f(x) 

Vx[P(f(f(x))) 3 — iP(f(x))] 

1. 

/i(x) 

1 

1 

1. 

U(x) /i(x) 

f 

2. 

/l(x) 

2 

1 

2. 

l.(x) /*(x) 

f 

3. 

/»(*) 

1 

1 

3. 

ll(x) /,(x) 

f 

4. 

/a(x) 

2 

2 

4. 

ll(x) / 4 (x) 

f 

5. 

/aW 

1 

1 

5. 

I 2 (x) /,(X) 

f 

6. 

77 *) 

2 

2 

6. 

U(X) /,(X) 

f 

7. 

f*W 

1 

2 

7. 

L(x) / 3 (x) 

t 

8. 

/ 4 (x) 

2 

2 

8. 

l 2 (x) /i(x) 

t 





9. 

I 3 (x) /i(x) 

t 





10. 

U(x) /i(x) 

f 





11. 

la(x) /a(x) 

t 





12. 

Ia(x) / 4 (x) 

f 





13. 

I 4 (x) /l(x) 

t 





14. 

I 4 (x) / 2 (x) 

t 





15. 

l 4 (x) /a(x) 

t 





16. 

l 4 (x) / 4 (X) 

t 

Here is the computation of Line 5 of the table 

for f(f(x)) (left): 






f(f(x)) 







/a(/a(D) 







/a(2 ) 

1 
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Wc now compute the entry for Line 7 of the table for the formula (right). 
This requires us to compute the following supplementary table. 

x W/aC/sto)) => 


1 

t 

2 

t 


Here are the computations for this supplementary table. 108 


I 2 (l )=>-iL(2 ) 

t 3 "if 

t 3 t 

t 


k(UM2)}) => -il a (/ 3 (2)) 

I*(2 )3“il,(l ) 

f 3 "it 

f => f 

t 


Since the supplementary table has a solid column of t’s, 

Vx[P(f(f(x))) 3 — iP(f(x))] is t in Line 7. 

The definitions of valid and of valid consequence (again to be abbre- 
viated using “t=”) are as before (also of “5-l=”). 

A little consideration shows that all of the model-theoretic results stated 
above for the predicate calculus remain good under this extension of the 
language, where now in Theorem 15 r can be any term (not just any 
variable) which is free for x in A(x) in the extension of the earlier notion 
(§ 18) obtained by calling an occurrence of a term in a formula free if each 
variable occurrence in it is free in the formula. Thus, briefly, r is free for 
x in A(x), if no (free) occurrence of a variable in r becomes bound as a 
result of substituting r for the free x’s in A(x). Similarly, in the definitions 
preceding Theorem 17, and in Theorem 18, r x , . . . , r Pi or r x , . . . , r„ can 
be any terms under the conditions stated there for variables. 

The proof theory of the predicate calculus with functions differs only 
by our allowing the r in the V-schema and the 3-schema to be any term 
free for x in A(x). The corresponding changes can then be made in V- 
elimination and 3-introduction in Theorem 21 and in its Corollaries. 

Now that we are allowing 0-place mesons to name individuals (i.e. 
members of D), we can use these for “Jane” in (a)-(c) beginning § 16, and 
for “Socrates” and “five” in Examples 19 and 20 § 26. The only difference 
this makes from using variables (held constant for the valid consequence 
and deducibility relationships) as we did in Chapter II is that there we 
might have used the same variables also in bound occurrences. That would 
make no trouble for our logical analyses, though it seems unnatural since 
in ordinary languages we normally use proper nouns etc. for such purposes. 

iog \y e are sav | n g a s t C p here by using the table for f(f(x)) (above left). But of course 
it is not necessary first to set up tables for terms occurring in our formula. 
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The substantial gain in using functions comes when functions of n > 0 
variables are employed in conjunction with the equality predicate (§§29, 30). 

Exercises. 28.1. How many lines does the table have when D = {1, 2}? 
Compute the indicated line, 

(a) g(f(g(x, f(h))), x) when f(x), g(x, y), h, x are/ 4 (x),/ 7 (x, y), 1, 2, respec- 
tively (where /,( 1, 1) =/,( 2, 2) = 1,/ 7 (1, 2) =/,( 2, 1) = 2). 

(b) VxP(g(f(g(x, f(h))), x)) V Q, with the above assignment extended by 
l 2 (x), f to P(x), Q. 

28.2. Which of the following terms are free for x in the formula 
Vw(P(x,y)V3zQ(y,x)=>R(w))? (a) x. (b) g(z, f(x, y)). (c) f(x, y). (d) 
g(w, y). (e) g(y, f(h, x)). 

28.3. Fix the proof of Theorem 15 (b) to apply here. 

* § 29. Equality. The predicate calculus with equality (or with identity) 
can be described as arising from the predicate calculus (Chapter II or 
§ 28) by giving one of the 2-place ions a special treatment. We write this 
ion as “ — = — ” and read it “ — equals — ”. 

For our model theory, we evaluate x=y by the logical function I(x, y) 
whose value is t when x and y have the same value, and f otherwise (e.g. 
when D = {1, 2}, by l 7 (x, y) Example 3 § 17). So in the predicate calculus 
with equality, the truth tables are not entered with a value of x=y, as its 
value is fixed in advance (as are the values of the logical symbols Z5, &, 
V, “i, V, 3). In the predicate calculus with equality, by a proper ion we 
shall mean an ion other than — =— . 

Example 2. Here is the truth table for D — (1, 2} of 
Vx[P(f(x)) V 3y x = f(f(y))] (written in two columns to save space). 

Vx[P(f(x))V3yx = f(f(y))] 



P(x) 

f(x) 


P(x) 

f(x) 


1 . 

li(x) 

/l(x) 

t 9. 

I 3 (x) 

/l(x) 

f 

2. 

liW 

/ 2 (x) 

t 10. 

Wx) 

/ 2 (x) 

t 

3. 

h(x) 

A(x) 

t 11. 

I 3 (x) 

/ 3 (x) 

t 

4. 

ll(x) 

m 

t 12. 

I 3 (x) 

/ 4 (x) 

t 

5. 

I 2 (x) 

/l(x) 

t 13. 

l 4 (x) 

m 

f 

6. 

l 2 (x) 

/ 2 (x) 

t 14. 

I«(x) 

/ 2 (x) 

t 

7. 

( 2 (x) 

/ 9 (X) 

t 15. 

I 4 (x) 

/ 3 (x) 

t 

8. 

i 2 (x) 

/ 4 (x) 

f 16. 

I 4 (x) 

/ 4 (x) 

f 


Here is the computation of Line 7. 

Vx[P(f(x))V3yx=f(f(y))] 
Vx[l 2 (/ 3 (x))V3yx=/,(/ 3 (y))] 
t 


(i) 

(ii) 
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To get (ii), we need the following supplementary table. 


x I a (/.(x))V3yx =/*(/, (y)) 



The two computations for this are as follows (explained below). 


!*(/»(!)) V3y 1 

=/.(/.( y)) 

I a (/s(2)) V 3y 2 

=/»(/ .(y)) 

1,(2 )V t 


1.(1 

)V t 


f v t 


t 

V t 


t 



t 


To get the t used here in the second line of each column, 

we need two more 

supplementary tables: 





y i 

=/a(/s(y)) 


y 

2=/ a (/,(y» 

(bi) j 

t 

(b.) 

i 

f 

2 

f 


2 

t 

Here are the computations for these. 




1 =/»(/»( 0) 

\=HW)) 

2= 

■-mm 

2=/ a (/.(2)) 

1= 1 

1 = 2 

2 = 

= i 

2=2 

t 

f 

f 

t 


The last step in each of these four computations is by the rule for evalu- 
ating = . Since each of (bj) and (b 2 ) has a t, we get the t in the second lines 
of the two computations for (a) by the rule for evaluating 3. Then by the 
rule for V we get (ii). 

Example 3. Here are the tables for 3xVy(P(y) 3 x=y) and 
3x[P(x) & Vy(P(y) D x=y)] when D = {1,2, 3}. First we list the possible 
values of P(x) (as in § 17 Example 4). 

x I,(x) l,(x) I 3 (x) l 4 (x) l 5 (x) I a (x) l 7 (x) I 8 (x) 

1 t t t t f f f f 

2 ttfftt^ff 

3 tftftf'tf 
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(a) (b) 

P(x) 3xVy(P(y) 3 x=y) 3x[P(x) & Vy(Pfy) 3 x=y)] 


«*) 

f 

f 

4(x) 

f 

f 

4(x) 

f 

f 

l 4 (x) 

t 

t 

«*) 

f 

f 

I«(x) 

t 

t 

l 7 (x) 

t 

t 

h(x) 

t 

f 


We now compute Line 6 for (a) and (b). We shall need the following 
supplementary tables. 

(a') (b') 

x Vy(( 6 (y) 3 x=y) l 6 (x) & Vy(l 6 (y) 3 x=y) 


1 f f 

2 t t 

3 f f 

To compute these we need in turn three supplementary tables: 

(al') (a?) (a") 

y 4(y)3 1=y l a (y)32=y l G (y) 3 3=y 


It t t 

2 f t f 

3 t t t 

Here are the computations for (a'/) (and those for (a£), (aj) are similar): 


1 6 (1) 3 1 = 1 ( 6 (2) 3 1=2 I 6 (3) 3 1=3 

f ^ t t 3 f f 3 f 

t f t 

Using the tables (ai)-(a!,')> the evaluation rule for V gives the values shown 
above in (a'); and those in (b') are the same thus: 

l s (l) & Vy(I 6 (y) 3 1 =y) 1,(2) & Vy(l 6 (y) 3 2=y) ! 6 (3) & Vy(l 6 (y) 3 3=y) 
f & f t & t f & f 

f t f 


Since (a') and (b') each have a t, the entry in Line 6 for (a) and (b) is t. 
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Now we note that in Table (a) 3xVy(P(y) 3 x=y) is t for exactly those 
l(x)’s which have at most one t; and in Table (b) 

3x[P(x) & Vy(P(y) 3 x=y)] is t for exactly those l(x)’s which have exactly 
one t. The student should be able to convince himself that this is the case 
not just for D = {I, 2, 3} but for any (nonempty) D. Thus 
3xVy(P(y) 3 x=y) expresses “there is at most one x such that P(x)” and 
3x[P(x) & Vy(P(y) 3 x=y)] expresses “there is exactly one x such that 
P(x)” or “there exists a unique x such that P(x)”. 

The latter notion is frequently enough used so that we adopt “3!xP(x)” 
as an abbreviation for 3x[P(x) & Vy(P(y) 3 x=y)]. More generally, 
“3!xA(x)” abbreviates 3x[A(x) & Vy(A(y) 3 x=y)], Vhere in unabbrevia- 
ting we pick for y a variable free for x in A(x) and distinct from x and the 
free variables of A(x). (Each two legitimate unabbreviations of “3!xA(x)” 
are congruent, end § 16.) Another useful abbreviation is “x?£y” for 
— ix=y, or more generally “r?*s” for ~ ir=s where r and s are any terms. 

The results stated above in the model theory of the predicate calculus 
remain good in the predicate calculus with equality. (Now in § 19, (I) is 
to be a list of only distinct proper ions including all occurring in E.) We 
also have further results, which we collect in the following theorem. 

Theorem 28. (a) k x=x. (b) k x=y 3 (x— z 3 y=z). 

(Open equality axioms for =.) 

(cj) k x=y 3 (P(x, a 2 ) 3 P(y, a 2 )). (q) 1= x = y 3 (P^, x) 3 P( ai , y)). 

(Open equality axioms for a proper ion P(a l5 a*).) 

(d'j) k x—y 3 f(x, a,)=f(y, a 2 ). (d 2 ) k x=y 3 f(a lt x)=f(aj, y). 

(Open equality axioms for a meson ffaj, a 2 ).) 

Similarly, for each n , there are n “ open equality axioms" (c")> • • • , (c") for 
each proper n-place ion P(a 1; . . . , a„), and n of them (dj')» • • • > (d") for each 
it-p/ace meson f(a,, . . . , a„). 

Proofs, (c,). By Corollary Theorem 8p, )=1 it will suffice to show that 
x = y, P(x, a 2 ) k P(y, a 2 ). So consider any D and any assignment which 
makes x=y and P(x, a 2 ) both t. Since x=y is t, the value of y in the assign- 
ment is the same as that of x; so, since P(x, a 2 ) is t,-so is P(y, a 2 ). — 

To establish the proof theory of the predicate calculus with equality, 
we start with the axiom schemata and rules of inference we already have 
(§21, or if functions are allowed §28), and add as further axioms the 
formulas shown to be valid in Theorem 28, as we anticipated doing when 
we called them “open equality axioms”. (The “closed equality axioms” 
are the closures of the open equality axioms, § 20.) Then by Theorem 28, 
Theorem 12 and Corollary continue to hold. 
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Now the provability and deducibility results already established for the 
predicate calculus all hold. This is obvious in the case of the direct rules 
(as in going from the propositional to the predicate calculus, § 21). More- 
over, in the proof of the deduction theorem (§§ 10, 22), applications of 
the new axioms can be handled under the old Case 3. Then we also have 
the subsidiary deduction rules based on the deduction theorem which are 
included among the introduction and elimination rules of Theorems 13 
and 21 . 


Theorem 29. (a) l-x=x. (e) b x=y 3 y =x. (f) h x=y & y=z 3 x=z. 
(Reflexive, symmetric and transitive properties of equality.) 

For any terms r, s, t, t x , t 2 , etc. : 

(gj) r=s br=t~s=t. (g 2 ) r=s ht=r~t=s. 

(h*) r=s b P(r, t 2 ) ~ P(s, t 2 ). (hi) r=s b P(t„ r) ~ P(t 1; s). 

(ij) r=s b f(r, t 2 )=f(s, t 2 ). (if) r=s b f(t„ r)=f(t x , s). 

Similarly, for each n > 0, we have ti such results (h”), . . . , (h*) for each 
proper n-place ion, and n of them (if), . . . , (i”) for each n-place meson. 

(Special replacement results.) 

Proofs. (For the format (B x ) which we use, cf. §§ 13, 25.) (e) Assume 
x=y. By (a), x=x. Substituting x for z in (b) (by Theorem 21 Corollary 
2(c) with F empty), x=y3(x=x3y=x), Using 3-elim. t.wice, y=x. 

(h^) Assume r=s. I. Assume P(r, t 2 ). By substituting r, s, t 2 for x, y, a 2 
in (cj), r=sD (P(r, t 2 ) 3 P(s, t 2 )). By 3-elim. twice, P(s, t 2 ). II. Assume 
P(s, t 2 ). By substitution in (e), r=s3 s=r. By 3-elim., s=r. By substitu- 
tion in (cl), s=r 3 (P(s, t 2 ) 3 P(r, t 2 )). By 3-elim. twice, P(r, t 2 ). 

Theorem 30. (Replacement theorem.) (1) Let r and s be any terms, t, 
be a term containing r as a specified ( consecutive ) part, and t s be the result 
of replacing that part by s. Then : 

r=s b t r =t 8 . 

(II) With r and s as in (I), let C r be a formula containing r as a specified 
(consecutive) part ( not the variable in a quantifier), C 8 be the result of 
replacing that part by s, and x x , ... ,\ Q be the variables of r or s which 
belong to quantifiers in C r having the specified part within their scopes. Let 
T be a list of ( zero or more) formulas not containing any of x lt . . . , x q free. 
Then : 

IfT b r=s, then T HC r ~ C 8 . 


Proof, as illustrated next. 
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Example 4. In the following, each of 1-3 after the first is deducible 
from the preceding, by (ij), (ij;). 

1. f(x, y) = g(x) 

2. h(f(x, y)) = h(g(x)) 

3. f(z, h(f(x, y))) = f(z, h(g(x))) 

4. f(z, h(f(x, y))) = g(z) ~ f(z, h(g(x))) = g(z) 

5. Vz f(z, h(f(x, y))) = g(z)~ Vz f(z, h(g(x)))=g(z) 

6. 3yVz f(z, h(f(x, y))) = g(z) ~ 3yVz f(z, h(g(x)))=g(z) 

Thus f(x, y) = g(x) b f(z, h(f(x, y))) = f(z, h(g(x))), as (I) asserts. 

Each of 1-6 is successively recognizable as being deducible from 
Vy f(x, y)=g(x), by V-elim., (ij), (ij;), (g,), *71a (since z does not occur 
free in Vy f(x, y) = g(x), *72a (since y is not free in Vy f(x, y) = g(x)). Thus 
Vy f(x, y) = g(x) b 3yVz f(z, h(f(x, y)))=g(z) ~ 3yVz f(z, h(g(x)))=g(z), 
as (II) asserts. 

Coroi laky I . For Cr, C. s , Xj, . . . , x„ as in (II) of the theorem: 
r=s b x «- x « C r ~ C B . 

Corollary 2. (Replacement property of equality.) Similarly: 

(a) 1J V br=s, then F, C r b C s . (b) r = s, C r b*f •• x " C s . 

Also (a)-(^) of § 5 hold now with “F b” replacing “b” and “=” 
replacing and (except in (a)) terms r, s, t, r 0 , r a , r 2 , . . . in place of 
formulas A, B, C, A 0 , A lt A 2 , . . .. Hence the chain method applies to 
equalities (as well as to equivalences) in the predicate calculus with equality. 
Its usefulness will be illustrated in §§ 38, 39. 

Tiii.oki.m 31. If b E, then there is a proof of E in which there occur no 
proper ions, anil no mesons, not occurring in E. 

Proof. Consider a given proof of E. In this there may occur proper 
ions, or mesons, not occurring in E. Take the atomic parts (or wholes) of 
formulas in this proof formed using such ions, and replace them all by 
Vxx=x. Then take the terms occurring as parts of formulas in the resulting 
figure which have such mesons as their outermost symbols and are max- 
imal, i.e. not parts of other such terms, and replace them all by the same 
variable v not occurring in the given proof. These two operations do not 
alter the end formula E or any equality axioms for = or proper ions or 
mesons of E, and they do not spoil any axiom or inference of the predicate 
calculus (cf. the 2nd paragraph of Footnote 86 § 25). However, open 
equality axioms for ions replaced are altered tox=y 3 (Vxx=x 3 Vxx=x), 
and for mesons replaced to x=y 3 v = v. But each of these is provable, 
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using *1 or (a), and Axiom Schema la. Replacing them by their proofs in 
the figure last obtained, we have a proof of E using only the proper ions 
and the mesons occurring in E. 

Corollary. I- E in the predicate calculus with equality, if and only if 
Qt» ■ • ■ > Q s E in the predicate calculus, where Q 0 , . . . , Q s are the dosed 
equality axioms for = and the proper ions and the mesons occurring in E. 107 

Proof. If. Using V-introds. to prove the closed equality axioms 
Q 0 , . . . , Q, from the respective open ones in the predicate calculus with 
equality. Only if. Using V-elims. to deduce the open equality axioms 
from the closed in the predicate calculus. 

Exercises. 29.1. Do the computation for Line 9 in Example 2. 

29.2. Write formulas in the predicate calculus with equality to express 
each of the following. 

(a) There are at most two x’s such that P(x). 

(h) There are exactly two x’s such that P(x). 

(c) The number of x's such that P(x) is between 2 and 5. 

29.3. Establish (b) in Theorem 28, and (f) and (g,) in Theorem 29. 
(Hint: for (f), begin with an appropriate substitution into (b).) 

29.4. Give a model-theoretic treatment of replacement under equality, 
paralleling Theorems 19 and 5 Pd (the extension of Theorem 5 in § 19). 

*§ 30. Equality vs. equivalence, extensionality. We have now (in 
both model theory and proof theory) the four basic properties of equality: 
reflexivity ((a) in Theorems 28, 29), symmetry ((e) in Theorem 29), 
transitivity ((f) in Theorem 29) and the replacement property (Corollary 2 
Theorem 30). 108 

The fourth of these properties, replaceability, is sometimes overlooked 
in listing the fundamental properties of equality. Model-theoretically, it 
holds on the basis of our interpretation of “x = y” as meaning that x and y 

107 If E does not contain =, then EE in the predicate calculus with equality if and only 
if V E in the predicate calculus. However, this is not as easy to prove (cf. Exercise 52.3 
and end Footnote 275 § 55). 

108 Using Theorem 12, . d= , the proof-theoretic results imply like results in model 
theory. 

We might have taken these four properties themselves to establish the proof theory. 
In deriving the last three, we have followed the tradition in proof theory of starting with 
as few and simple axioms as possible. This proof-theoretic treatment is due essentially 
to Hilbert and Bernays 1934 pp. 164 ff. 

Another (frequently used) method is to employ the axiom (a) and (instead of our 
other particular equality axioms) the axiom schema x=y D (A(x) 3 A(y)) where z is 
any variable, A(z) is any formula, and x and y are any variables (distinct from each 
other but not necessarily from z) free for z in A(z) (and A(x), A(y) are the results of 
substituting x, y respectively for the free occurrences of z in A(z)). 
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are the same object (Exercise 29.4). We built this interpretation into the 
rule for evaluating x = y in determining truth tables. 

Sometimes “equality” is used in a different sense, so that it possesses 
only the first three properties (refiexivity, symmetry and transitivity) but 
lacks the replacement property (except in special contexts). Whatever the 
terminology and symbolism, it is important to recognize that a relation 
possessing only the first three properties is not “equality” in the above 
sense. Such a relation we prefer to call only “equivalence” or “an equiv- 
alence relation”. 11,a „ 

To take an example, consider the set D of all fractions - where p and q 

1 -1 2 5 17 5 

are integers with q ^ 0. Thus D has as members -, 

— — , etc. Two such fractions - and 
-12 q s 


3 ’ 5’ 10’ r -15’ 
are equal (i.e. they are the same 


fraction) exactly if p = r and q = s (i.e. (/>, q) and (r, s) are the same 
ordered pair of integers). However, we are usually not interested in the 
fractions as fractions but as expressions representing rational numbers. 

Now the different fractions etc. all express the same rational 

number; — etc. express another; etc. This is sometimes described 

by saying that we “define equality” between fractions (assuming multipli- 

p r 

cation and equality of integers already known) by the formula - = - = 

ps = qr. (The student can remember the right member as the result of 
“cross-multiplying” in the left.) 

What objects “ratios” (after Euclid) or “rational numbers” actually 
are probably does not worry the schoolboy or the layman, provided he 
gets the “right” rationals in the “right” relationships to one another. Our 
natural concept of rational numbers seems to be that a rational number is 
something we get by “abstracting” from the fractions forming a set such 

as |l t . . . j, i.e. by disregarding what is different and keeping what 


We are concerned here with equivalence relations between members of the domain 
D. 

We have already been dealing in §§ 2, 4, 5, 15, etc. with an equivalence relation ~ 
(“material equivalence”) between propositions. There we had not only refiexivity, 
symmetry and transitivity (*91-*21 in Theorem 2, or ((})-(§) in § 5), but also replace- 
ability in the formulas considered (Theorems 5 and 23). Nevertheless, we do not con- 
sider the relation ~ to be an equality relation between propositions, since under our 
interpretation different (unequal) propositions can be equivalent (namely, they are 
when they have equal truth values).* 7 
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is common. The role of a theory such as we describe next is to show rigor- 
ously that a system of objects with the desired properties does exist. 

One way to construct these objects is to say that the rational J- is the 

SC1 {[ W T5- ’ ' I -5 is ,h = «• ( t -^ 5 - • ' l ;e,c ' Ano,her way i! 

to pick from each of these classes one particular fraction, say the one in 
lowest terms with positive denominator, so the fraction 1 is a rational 


... 5 -6 

number while — , — — , 


are expressions for the rational The first 


method has become fashionable as part of the current practice in classical 
mathematics of using set-theoretic concepts as the basic vocabulary. We 
now follow this method in giving a rigorous theory of the rational num- 
bers, supposing such a theory of the integers. . . , —3, —2, —1, 0, 1, 2, 
3, . . . already established. 

While we are giving this theory, in order to remove any possibility of 

confusion between a fraction - (as a fraction) and the rational number 

H n 

which it represents, we shall write the fractions - as ordered pairs ( p, q). 

*1 p 

After the theory has been established, we can revert to writing as the 

q 

schoolboy and the practicing mathematician do; the equations one 
ordinarily writes will then read correctly with “ = ” meaning equality 
between rationals by interpreting the fractions as names for the rationals. 

So far we have been tacitly assuming that the set D of the fractions 
falls into disjoint (i.e. nonoverlapping) nonempty classes D v D 2 , D 3 , . . . , 
where say = {(1, 2), (5, 10), ( — 6, — 12), . . .}, 

ZX, = {(—I, 3), (5, —15), . . under the criterion that (p,q) and (r, s) 
belong to the same class exactly if ps = qr in the theory of integers. We 
should justify this. 

First, however, we observe the following: (A) Suppose a nonempty set 
D is partitioned into disjoint nonempty classes. Then the relation “ x and y 
belong to the same one of the classes", or in symbols x~y, is an equivalence 
relation, i.e. ~ is reflexive (x ~ x), symmetric (x~y-+y~x) and 
transitive (x y & y ci « x z). This is so obvious that we can only 
ask the student to reflect on it a moment. 

Now, conversely: (B) If a binary relation ~ with D as its domain is 
an equivalence relation {i.e. if it is reflexive, symmetric and transitive), then 
D is partitioned into disjoint nonempty, classes such that members x and y 
of D belong to the same one of the classes exactly when x ~ y holds. 

Proof. For any member x of D, let x* be the class of all those members 
u of D such that x ~ u. 
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I. Because of the reflexive property of ~ (a- ~ x), x itself belongs to x*. 
Thus each of the classes x* for x in P is nonempty, and each member x 
of D is in at least one of the classes (the classes x* exhaust D). 

II. Now consider any members x and y of D. Case 1 : x~y. We show 
that, in this case, x* and y* are the same class (which by 1 contains both 
x and y)\ i.e. each member u of x* is a member of y*, and vice versa. To 
prove this, first suppose u is a member of x* (in symbols, uex*), i.e. 
x ~ u. From the case assumption x ~ y by the symmetric property of 

y ~ a r; so by the transitive property of y ~ u, i.e. u e y*. Similarly, if 
v e y*, i.e. y ~ v, then by the case hypothesis and the transitivity, x ~ v, 
i.e. v e x*. Case 2: not x ~ y. In this case, x* and y* have no members in 
common. For, if they had a common member z, then x~ z and y a z, 
and by symmetry and transitivity x ~ y, contradicting the case hypothesis. 

When we start with a set D and an equivalence relation ~ on D to 
discover the disjoint subsets of D by (B), we call those subsets the equiv- 
alence classes of D under the relation — . 

In particular, when D is the fractions, we can use (B) to conclude that 
D is partitioned into the disjoint nonempty subsets which we are taking 
as the rationals. Thus, we define ( p , q) cx. ( r , s) = ps = qr, and verify that 
this is an equivalence relation (Exercise 30.1 (a)). Then (B) applies. 

In going from the fractions to the rationals, we substitute for the 
domain D of the fractions the new domain D* of its equivalence classes 
under the equivalence relation just defined. In doing so, the equivalence 
relation ~on O becomes, or gives rise to, the equality relation = on D*\ 
i.e. two rationals x* and y* are the same (** = y*) exactly if, for any 
(p, q) e x* and ( r , s ) e y*, {p, q) ~ ( r , s'). 

The reason why we start with the fractions D to build the rationals D* 
is that the p, q of the fractions (p, q) provide us with'the means of delining 
and performing the operations we wish to perform on the rationals. 


p r 

Thus, we are accustomed to taking the sum - + - by the following 

i Q s 

P r ps qr ps + qr , , . 

manipulation: ’-+- = — + ?-■= (which may be reduced to 

q s qs qs qs 

lowest terms, if we wish). To emphasize that the operation is initially one 
on fractions as such, we make the definition (p, q) + (r, s) = (ps + qr, qs). 
But we want an operation on equivalence classes of fractions, since the 
rationals are those and not the fractions themselves. To confirm that this 


addition of fractions (p, q) + (r, j) does give an operation on rationals 
x * ,y* 5 we mL ist show that the equivalence class to which (ps + qr, qs) 
belongs depends only on the equivalence classes a;* and y* to which (p, q) 


and (/', 5 ) belong. That is, we need to show that 


(p, q) c=L (/>!, </,) & (/•, s) ~ Oy, .v,) — (ps + qr, qs) ~ (p l s l + qp u qp\) 
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(Exercise 30.1 (b)). For then the addition of equivalence classes ** and 
y* is defined with a uniquely determined equivalence class z* as result, 
thus: pick any member (p,q) of x* and any member (r, s) of y*; add by 
the above rule; and take as z* the equivalence class to which the resulting 
fraction (ps+qr, qs) belongs. 

Similarly, we can define the predicate x* < y* between rationals x* 
and y* by first putting (p, q) < (r, s) s (ps < qr if qs > 0; ps > qr if 
qs < 0), and then showing that the truth or falsity of (p, q) < (r, s) 
depends only on the equivalence classes of (p, q) and (r, s) (Exercise 
30.1 (d)). 

Not all operations on fractions as pairs of integers (p, q) with q 0 
depend only on the equivalence classes to which the fractions belong. 
E.g. (p,q) ° (r, s) = (p+r,q+s) is a perfectly well defined operation 
(function) on fractions which does not induce an operation on rationals. 

Equivalence (between members of D), unlike equality, does not in 
general have the replacement property. For example, from 
(p, q) + (r, s) = (/, u) and (p, q) ~ (/>„ q t ) it does not follow that 
( P\t q t ) + ( r , s) = (/, u ) (though (p u q t ) + (r, s) ~ ( t , w) follows). 

To summarize what has been elaborated on the example of the rationals, 
if someone has introduced a collection D of objects, and then says “let us 
define ‘equality’ between these objects”, he is in our terminology pro- 
posing to define an equivalence relation. He is not free to define “equality” ; 
the equality relation between the objects is already fixed simultaneously 
with the introduction of the set D. What he is proposing to define will 
become equality in the new domain D* consisting of the equivalence 
classes of D, or in a layman’s terms of the objects that we can abstract or 
invent by disregarding the differences which in D subsist between members 
of an equivalence class. — 

Example 5. We saw in § 29 that 3x[P(x) & Vy(P(y) D x = y)] expresses 
“there is a unique x such that P(x)” in the model-theory of the predicate 
calculus with equality. This cannot be expressed in the model theory of the 
predicate calculus without equality (Chapter II). To show this, suppose to 
the contrary we had a formula E of the latter with just one ion P( — ) such 
that E is true when and only when P(x) is evaluated by a logical function 
t(x) which is t for exactly one member of D. Then in particular, E is t 
when D = {1, 2, 3} and P(x) is evaluated by l 6 (x) (which has the table f, 
t, f). Now consider instead a D with four members, say D = {1, 2 1; 2 2 , 3}, 
and let P(x) be evaluated by I(x) where 1(1) = f, = L(2 2 ) = t, 
1(3) = f. In the new computation, 2 X and 2 2 behave just as 2 did before, 
so again E will be t. This contradicts our supposition, since this l(x) is 
true for two values of x. 

As Example 5 illustrates, the rules for evaluation in the model theory of 
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the predicate calculus without equality provide no barrier against 
splitting any element of any D into a multiplicity of elements which will 
behave alike in all the computations previously considered. (This process 
is the reverse of coalescing elements into equivalence classes as new 
elements.) — 

Now we discuss problems of translation from ordinary language into 
logical symbols when equality may be involved. The basic presupposition 
for our model-theoretic treatment of equality is that we are dealing with 
some collection or set as the domain D. This we think of as constituted of 
“definite well-distinguished objects”. 110 So if a; is a member of D and y is 
a member of D , then either x and y are the same member ( x—y is true) or 
x and y are different (distinct) members ( x=y is false). In brief, our 
notion of a domain D already entails the equality predicate over D. 

Any application of the predicate calculus, now with equality, is to 
come after we have a set D as the domain, which we have either con- 
structed or assumed to exist on the basis of supposed facts or for the 
purpose of the argument. A domain in our sense may constitute quite a 
sophisticated (or even problematical) abstraction from perceptual 
experience. 

To illustrate, we take the cases of temperature and color. Our sensations 
of each are fuzzy, so that close temperatures and colors intergrade; it is 
often hard to say whether two objects under given conditions have the 
same temperature or color. But physics and psychology have constructed 
mathematical models, in which temperatures are represented “exactly” 
by real numbers (> an “absolute zero”), and colors by points in some 
three-dimensional region (or triples of real numbers subject to certain 
restrictions). In classical mathematics, each two real numbers x and y are 
either equal or unequal, by the following rule: write each real number as 
the sum of an integer (positive, zero or negative) and a nonnegative 
infinite decimal fraction; e.g. n = 3 + .14159. . . = (briefly) 3.14159. . . , 
2/3 = 0.66666. . . ; 75/2 = 37.50000 = 37.49999. . . , 

—3/4 = — 1 + .25000 = — 1 + .24999. . .. Sometimes there will be two 
ways to do this, one using infinitely repeating 0’s and the other infinitely 
repeating 9‘s; in these cases, let us now choose always the same way, say 
the one with repeating 9’s. Now x = y if x and y have the same integer- 
plus-decimal representation; x ^ y otherwise, i.e. if the integers are 
different, or the decimals differ in at least one digit. 111 

110 Cantor 1895 p. 481. Cantor created the theory of sets between 1874 and 1897. 
We deal with this in §§ 32-35 of Chapter IV. 

111 There are other theories of real numbers, equivalent to this but more elegant (for 
one, cf. IM pp. 30-32). Intuitionists (§ 36) do not accept these theories of real numbers; 
cf. Heyting 1934, 1956. 
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If one starts with our notion of a domain D, and forms all possible 
one-place predicates P(x) over D (or simply all possible one-place logical 
functions l(x) over Z)), 87 then x — y can be defined thus: 
x = y — {for all P, P(x) ~ P(y)}. For, if x and y are the same member of 
D, then by our notion of a predicate, for each P, P(x) and P(y) are the 
same proposition, so P(x) P{y). On the other hand, if x and y are 
different members of D, then there are predicates P which “separate” 
them, i.e. such that P(x) is true while P(y) is false or vice versa, so not, 
for all P, P(x) ~ P(y). This definition expresses the so-called principle of 
identity of indiscernibles of Leibniz 1685+ (reproduced in Lewis 1918 pp. 
373-387): if we can discern no property P in which x and y differ, x and y 
are identical. In second-order predicate calculus (references in § 17), we 
can hence regard x=y as an abbreviation for VP(P(x) ~ P(y)), instead of 
introducing it as a primitive predicate. Since conceptually the idea of 
equality underlies our notion of a domain and of predicates over the 
domain, it seems more elementary and direct to introduce equality as we 
do than to define it by reference to all predicates. Of course, the Leibniz 
definition would only be available to us if we were considering second- 
order logic. 

For the Leibniz definition to seem a natural way to introduce the idea 
of equality (i.e. a way which explains its meaning rather than simply 
states a necessary and sufficient condition for what is already understood), 
one must start from a rather different position than in our model theory. 
For, it requires one to entertain the applicability of properties P to 
objects x, or to consider the truth or falsity of values P(x) of a predicate, 
before the objects x themselves are clearly distinguished. 

In our theory with a presupposed domain D, whether objects are equal 
or not is dependent on the picture of the world out of which we have 
constructed the D. 

Is the morning star x equal to (the same as) the evening star y? (In 
symbols, is x = y?) It is when the domain D** is that of astronomical 
bodies, wherein both the morning star and the evening star are Venus 
(seen at different times). To shepherds, unversed in astronomy, the star 
observed in the morning, and the star observed in the evening (on other 
dates) are two different natural phenomena; the domain D* is different. 
To a child, the evening star seen on one date and the evening star on 
another date may be different phenomena is still another domain D. We 
can think of D* as equivalence classes in D, and of D** as equivalence 
classes in D* (or larger ones in D), where in each case the abstraction 
results from recognizing different members of one set as related by an 
equivalence relation or thinking of them as different manifestations of the 
same “underlying reality”. 
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The forward implication in Leibniz’ definition of equality, namely 
that x = u implies 'iP(P(x) ~ P{y)), expresses in a little different way 
what we called the “replacement property of equality” (Corollary 2 
Theorem 30). Our principle is stated with respect to the class of formulas 
under consideration, and includes the case of replacement inside quanti- 
fiers. The same property is also sometimes described as “extensionality”, 
from the standpoint of the contexts in which x = y justifies the replace- 
ment of x by y. A context in which such replacement is justified is called 
extensional ; one in which it is not, nonextensional or sometimes intensional. 
This jibes with our previous use of “extensional” vs. “intensional”, where 
replacement under was involved. 1 ' 2 

Consider the following, “Let n be the number of planets in the solar 
system. Kepler did not know that n > 6. But n — 9 (according to current 
science). Therefore Kepler did not know that 9 > 6.” 112 The context 
“Kepler did not know that — > 6” is intensional, since the meaning of 
what is put in place of the blank, rather than just the value of it as a 
member of the domain D, affects the truth of the result. So the replacement 
of “/?” by “9” on the basis of the equality “« = 9” is not justified in 
“Kepler did not know that n > 6”. 

By this example, we can add the logic of knowing, believing, etc. to 
the examples already given end § 12 of nonextensional contexts for 
replacement under material equivalence, i.e. where equality of truth values 
is insufficient to justify replacement. For, from “/? = 9” we get 
"n > 6 ~ 9 > 6” by a correct application of replacement (Theorem 
30(11)); but “/? > 6” cannot be replaced by “9 > 6” in “Kepler did not 
know that n > 6”. 

When a context is not extensional for replacement under equality, 
something is being taken into account in it that is not recognized in 
constituting the elements of the domain D. By splitting up those elements 
to construct a new domain D\ the equality in D may become an equiva- 
lence relationship on D' while extensionality is restored. Thus if we 
construe D‘ as “mental” objects for Kepler rather than “real” objects, 
so that n and 9 are different mental objects, then “n = 9” is not true. So 
the occurrence of nonextensional contexts for equality can be regarded as 
just another way of looking at the difference between an equality and an 
equivalence relationship. Admittedly, in matters of knowledge or belief, 
it may be rather hard to describe exactly a domain D' which suffices to 
restore extensionality. 

Since a shift in our understanding of what the domain D of objects is 

112 Kepler, famous for his discovery of the laws of planetary motion, died in 1630. 
The planets Uranus, Neptune and Pluto were discovered in 1791, 1846 and 1930, 
respectively. 
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can make a difference between equality and equivalence, it is not surprising 
that the terminology is often confused. Here is a table of translations. 

x=y x is the same (object) as y. 

x is identical with y. 
x equals y [usually], 
x is y [sometimes]. 

The first two readings are the least ambiguous. But we normally use 
the reading “equals” for because it is briefer than the first two, and 
indeed is the standard way of reading “=” in mathematics. But mathe- 
maticians often use “=” and “equals” for what we consider an equivalence 
relationship. As a new example, textbooks in plane Euclidean geometry 
often write “ AB = CD" to mean that the line segments AB and CD are 
of the same length. One can’t then replace “ AB ” by "CD" in “AB X EF " 
(read “AB is perpendicular to EF"). So the translator into our logical 
symbolism must be on guard to recognize whether “equals” is being used 
for our = , or for an equivalence relationship. Of course, “length 
AB — length CD” is an unambiguous translation of "AB = CD" in the 
present example, and this could be written "AB ~ CD". Besides equality 
or identity in the fundamental sense, mathematicians need many (other) 
equivalence notions. So terms like “equivalent”, “congruent”, “similar”, 
“homologous” (and as we have seen, sometimes “equal”) are often 
employed for such equivalence notions. 

We come finally to “is” or other forms of the verb “to be”. In addition 
to its use as an auxiliary verb (“he is going”, “this was done”, “John is 
loved by Jane”) and in expressing the existential quantifier (“there is”), 
we need to notice three others. (1) When A and B are objects both con- 
sidered as members of the same domain D, "A is B ” (almost always) means 
"A — B" in our sense, i.e. that A and B are the same member of D. 
Examples: “The number of planets is 9”, “Two plus two is four”. (2) When 
A is an object considered as a member of a domain D, and B is a subset of 
D, "A is B ” means that A is a member of B, and is translated by “A e B" 
or B(A). Examples: “Socrates is a man”, “Snow is white”, “Jane is lovely”. 
(3) When A and B are both subsets of D, "A is B" means that the subset 
A is included in (is a subset of) B, and is translated by "A £ B" or 
Vx(A(x) D B(x)). Examples: “Men are mortal”, “Cats are animals”. 

So as not to give away the translation in advance, we have used capital 
"A” and "B” in all three uses. With symbolism chosen as we have usually 
chosen it, (l)-(3) would be distinguishable by lower case (small) letters for 
members of sets and capital letters for sets, thus: (1) “a is b” translated 
“ a = b ”, (2) “ a is B" translated "a e B" or B(a). (3) “A is B" translated 
“A s B ” or Vx(A(x) D B(x)). (In translation from verbal language the 
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problem is then, “Which terms correspond to small letters, and which to 
capitals?”) 

This usage cannot be preserved if we consider sets whose members are 
also sets whose members we may want to name. “Men are numerous” 
should be translated “ M e N" where M is the set of men and N is the set of 
numerous sets. 

It should hardly need to be pointed out that confusion of these meanings 
of “is” can lead to fallacies. 

Under Meanings (1) and (3), “is” is transitive (a = b & b = c — ► a = c, 
and (Barbara, Exampje 23 §27)); but not 

under Meaning (2) (fallacious example: “Socrates is a man. Men are 
numerous. Therefore Socrates is numerous.”). 113 Under (1), “is” is both 
reflexive and symmetric; under (2), “is” is not symmetric, nor (as set 
theory is usually understood) reflexive; under (3), “is” is reflexive, but 
not symmetric. 

Exercises. 30. 1 . (a) Show that ( p , q) ~ (r, s) = ps — qr is an equiva- 
lence relation between pairs of integers (p,q) with q ^ 0. (b) Show that 
(p, q) + (r, s) — (ps + qr, qs) is an operation depending only on the 
equivalence classes (cf. the text), (c) Treat similarly (p, q) ■ ( r , s) = (pr, qs ). 

(d) Treat similarly (p, q) < (r, s) as defined in the text. 

30.2. Let D = {0, 1,2, 3, 4, 5, 6, 7} and x~y = (x—y or y—x is an 
even natural number). Show that ~ is an equivalence relation on D, and list 
the equivalence classes. 

30.3. Translate into logical symbolism, and establish validity or 
invalidity. 

(a) Everybody loves Jane. Jane loves someone. Therefore there are two 
people who love each other. 

(b) Everybody loves Jane. Jane loves someone besides herself. Therefore 
there arc two people who love each other. 

(c) 7t is the ratio of the circumference of a circle to its diameter, n is 
between 3.1415 and 3.1416. Therefore the ratio of the circumference of 
a circle to its diameter is between 3.1415 and 3.1416. 

(d) Samuel Clemens wrote “Huckleberry Finn”. Mark Twain wrote 
“Huckleberry Finn”. “Huckleberry Finn” is the work of one author. 
Therefore Mark Twain is Samuel Clemens. * 

(e) Jane has at most one husband. Jane is married to Thomas. Thomas is 
thin. William is not thin. Therefore Jane is not married to William. 

(f) Tom is a brother of Dick. Dick is a brother of Harry. No man is his 
own brother. Therefore Tom and Harry are different people. 

(g) * Every person in this room has a brother and a sister both in this 
room. No person is his own brother or sister. No brother of anyone is a 


113 Example from Suppes 1957 p. 183. 
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sister of anyone. Therefore there are either no people or at least four 
people in this room. 

(h) The moon seen today is round. The moon seen twelve days ago was 
crescent shaped. The moon today and the moon twelve days ago are the 
same object. Therefore something is both round and crescent shaped. 

(i) A rose is red. Red is a color. Therefore a rose is a color. 

(j) A gnu is an antelope. An antelope is a mammal. Therefore a gnu is a 
mammal. 

(k) Salt and sugar are white. Nothing is both salt and sugar. Therefore 
nothing is white. 


*§31. Descriptions. To complete our survey of first-order logic 
(cf. § 17), wc shall discuss briefly the definite article “the” and the 
indefinite article “a” or “an”. 

We can name objects by descriptive phrases containing the definite 
article “the”, such as the following: (a) “the present queen of England”, 
(b) “the present king of France”, (c) “the sister of x", (d) “the father of 
(e) “the number which added to x gives y", (f) “the number whose 
square is x", (g) “the least common divisor of x and ?/”, (h) “the third 
smallest prime number”, (i) “the greatest prime number”, (j) “the number 
iv such that, for all x, x + w = a;”. In our model theory, by “objects” we 
mean members of the domain or universe of discourse D about which we 
are thinking at the moment. 

For some of the objects just named, we have other names not using 
“the”: (a,) “Elizabeth II” (1966), (ej y - x, (h t ) 5, (j,) 0. The usefulness 
of descriptions as a linguistic device is that it gives us a way to make a 
name, usually for temporary use, when we do not already have one but 
have the vocabulary to describe or characterize the object. 

As our language is commonly understood, a definite description is not 
used “properly” unless it really does describe a unique object, or, il free 
variables are present, a unique object for each choice of values of those 
variables. The general form of a definite description is “the >v such that 
F(w)’\ If F(\\ j is a predicate of just the one variable tv, the description is 
proper exactly if there is just one object tv in the domain D such that 
F(uj; this condition translates into our logical symbolism as 3!wF(w) 
(cf. § 29 ). If F(w) is a predicate of other variables x lt . . . ,x„ as well, write 
it then also “F(x„ . . . , x n , iv)”, then (as long as we are not confining our 
attention to only certain values of x u . . . ,x n ) the description is proper 
exactly if, for each x lt . . . , x n in D, there is a unique tv in D such that 
F(x j, . . . , x n , w); in our symbols, Vx! . . . Vx^lwFfXi, . . . , x„, w). In 
this case, the description serves to name a function f(x lt . . . , x n ) (as well 
as its values). 105 We include the previous case under this, by allowing n = 0 
so an individual (or object) is named. 
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Since 1848, (b) has been an improper description. 114 When D is all 
persons, (c) is improper, since not every person x has just one sister; 
only for smaller D's does (c) define a function f{x). But (d) is proper, 
though proverbially only a wise x knows the value of /(#). When D is 
the positive real numbers, (e) is improper (e.g. no positive real number can 
be added to 5 to give 3), but (f) is proper. When D is all the real numbers, 
(e) is proper, but (f) is not (e.g. 4 has two square roots, 2 and —2). By a 
theorem of Euclid, (i) is improper. 

In ordinary discourse, improper descriptions hardly occur. If someone 
speaks about “the w such that F(W)" when there isn’t a unique such w>, 
we usually conclude that he is either confused himself or is attempting to 
mislead us. We might try to say that any sentence A containing an 
improper description is simply false. But then we run into the difficulty 
that ~iA and A 3 B would also be false by this criterion, whereas (by our 
truth tables for ~ i and 3) -iA and A 3 B must be true when A is false. 
Whitehead and Russell 1910 pp. 69-75 (66-71 in the 1925 ed.; also it’s in 
van Heijenoort 1967) handled this difficulty by requiring the part of a 
sentence which is declared false because it contains an improper description 
to be indicated; the truth or falsity of larger parts containing it is then to 
be determined by the usual rules. There is some awkwardness in this; 
and the problem of the best way to treat descriptions allowing improper 
ones is still a subject of research. 119 Improper descriptions, and partially 
defined functions (and in other connections, multiple-valued functions) 
have some uses in mathematics. However, for the brief treatment we give 
here, we shall follow common usage in avoiding improper descriptions. 

But then we have another difficulty, that what a sentence is (or in our 
treatment of logic, what a formula is) will not always be determinable 
immediately from the way it is assembled out of its parts (as in our definitions 
of “formula” in §§ 1, 16, 28, 29), but will depend on validity or provability 
(or valid consequence or deducibility) results. That is, if A contains a part 
“the w such that F(w)”, and would be a formula under the definition in 
§ 28 or § 29 if this part were allowed as a term, we shall not know whether 
to call A a formula until we know whether we have 3!wF(w). 

As the price of abjuring improper descriptions, we shall have to live 
with this difficulty. 118 It will not be very hard to do so, since in ordinary 

114 Or even since 1830. Louis Philippe’s title was “King of the French”, not “King of 
France”. 

115 See Scott 1967, where other references are given. 

n * This difficulty is slightly disguised in the present treatment [Theorem 32], since for 
the predicate calculus with functions and equality we have not yet said anything about 
limiting the supply of mesons available. But if F be closed formulas which express the 
axioms of some theory for which only the ions and mesons of F are considered available, 
then the introduction of f entails an enlargement of the class of formulas which we make 
only after obtaining Vx t . . . Vx„3!wF(x,, . . . , x„, w). 



DESCRIPTIONS 


169 


§31 


discourse and in mathematics we normally want to use definite descrip- 
tions only after we already have 3!wF(w) or 
Vxj . . . Vx„3!wF(x 1 , .... x,„ w). 

We might introduce descriptions into our symbolism by adding an 
operator *‘iw” (read “the w such that”). This would bind variable occur- 
rences in terms iwF(x l5 . . . , x„, w) (and in terms and formulas containing 
such terms), just as Vx and 3x do in formulas. 

In our treatment here we shall stay within the notation of the predicate 
calculus with functions and equality as treated above. The descriptions 
we want can then be handled simply as introductions of new function 
symbols f. 117 Thus, when we have 

(i) Vxj . . . Vx„3!wF(Xj, . . . , x„, w), 

we may introduce a new function symbol f with the formula 

(ii) Vxj . . . Vx„F(x„ . . . , x„, f(Xi, .... x„)). 

We may even read f(x x , . . . , x K ) as “the w such that F(Xj, . . . , x,„ w)” if 
we wish; but its properties will be given by (ii) with (i). We shall make 
this explicit in the theorem below. Any finite succession of descriptions, 
each after the first possibly involving the preceding ones, can thus be 
provided, without piling up complicated patterns of bound variables in 
the terms. This is the usual practice in mathematics, except in simple 
passages. 

In this treatment, we lose the advantage that descriptive definitions are 
self-explanatory, since each time a new function symbol f has been 
introduced, we have to remember the formula (ii) with which it was 
introduced. If the function expressed by the symbol is to be used often, 
we will quickly become used to the symbol f for it, and the notation 
f(x 1( . . . , x„) will be more convenient than the more cumbersome 
twF(x t , .... x,„ w). It is in speech and in short written passages that 
the descriptive operator “the w such that” or “iw” has a clear advantage 
over introducing new function symbols. However, for our logical analyses 
of arguments encountered in ordinary language, it is a simple matter to 
supply (or imagine supplied) function symbols corresponding to the 
descriptive names used in the language. 

117 We say “function symbol f ” rather than “meson f( — , .... — )” since we really 
have in mind a meson consisting of a single new symbol (a “function symbol f”) with 
argument places following if n > 0. However, the following treatment can also be read 
with “meson f(— , .... — )” in place of “function symbol f ”, so that other convenient 
notations could be used. In fact, the meson f( — , . . . , — ) for a given description could 
even be iwF( — , w), provided terms arising from different such mesons 
successively introduced cannot be confused (and the bound variables w are chosen so 
that the substitutions performed for x,, . . . , x„ in twF(x,, . . . , x„, w) are free; cf. IM 
bottom p. 154). 
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So we are to get the effect of proper (definite) descriptions by a theory 
for adding new function symbols when it has become known (absolutely 
or under the assumptions we have made) that the functions are definable 
descriptively. 

We shall give the treatment only in model theory, where it is almost 
trivial. A proof-theoretic discussion (after Hilbert-Bernays 1934) would 
be a little too detailed to fit well here. 118 

Theorem 32.' Let T be any list of zero or more formulas in the predicate 
calculus with functions and equality, F(Xj, . . . , x„,"w) be a formula con- 
taining free only the distinct variables x x , ... , x„, w (n > 0), f be a function 
symbol not occurring in any of the formulas T and F(x 1( . . . , x„, w), and C 
be a formula not containing f. Then : IfV 1= Vx x . . , Vx„3 !wF(Xj, . . . , x„, w) 
and r, Vxj . . . Vx„F(x lt .... x„, f(x„ . . . , x„)) k C, then V k C. 

Proof. Consider any given domain D, and any given assignment in 
D to just the free variables, ions and mesons in T, 

Vxj . . . Vx, i 3!wF(Xj, . . . , x„, w), C which makes all of T t. We must show 
that C is also t. By the hypothesis that T k Vx x . . . Vx B 3!wF(x 1( . . . , x,„ w), 
the formula Vxj . . . Vx,,3!wF(x„ . . . , x,„ w) is t. By Example 3 § 29 
and the evaluation rule for V, hence, for each choice of members 
x t , . . . , x„ of D as values of x„ . . . , x„, there is a unique member vv of D 
such that F(x 1( . . . , x„, w) is t when tt> is the value of w. So for this w 
and ar„ . . . , x„, we can write tr = f(x x , . . . , x n ) where / is an «-place 
function with arguments and values in D. But now, if we augment the 
given assignment by taking /(x „ . . . , x„) as value of f(x 1( .... x„), then 
Vx, . . . Vx„F(x„ . . . , x,„ f(x„ . . . , x,,)) is t. So by the hypothesis that 
F, Vx x . . . Vx,,F(x„ . . . , x,„ f(x„ . . . , x„)) k C, the formula C is also i. 
This is under the augmented assignment; but since f does not occur in C, 
the formula C is t under the given assignment, as was to be proved. — 

The effect of this theorem is that, if Vx x . . . Vx„3!wF(x 1 , . . . , x„, w) is 
a valid consequence of our assumptions F, then to the symbols we are 
using we can add f(Xj, . . . , x„) expressing “the w such that F^, . . . x„, w) 
is t when u is the value of w” and to our assumptions the formula 
Vxj . . . Vx„F(x 1 , . . . , x re , f(x x , . . . , x„»; any formula C not containing 
f which is a valid consequence of the augmented assumptions will be a valid 
consequence of the original assumptions. In the process of recognizing C 
to be a valid consequence, our reasoning may be helped by the additional 
assumption Vx x . . . Vx„F(x x , . . . , x,„ f(x 1; . . . , x J). In fact, there can 
be a very great gain in efficiency here, through the use of the function 

118 Cf. IM §74. From our model-theoretic treatment and the Godel completeness 
theorem (§ 52 below), some of the proof-theoretic results will follow (but this will not 
give them in the elementary way which is desired in proof theory; cf. § 37). 
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symbol f(x„ . . . , x„) in conjunction with the replacement theorem 
(Theorem 30 § 29). 

Example 6. Let the formulas be constructed as in § 29 with 1, 

(or also — -1 ) as the mesons. In the predicate calculus with equality, 

(1) VxVyVz (xy)z=x(yz), Vx xl =x, Vx xx -I = 1 hxz=yzDx=y 

(Exercise 31.2). Hence, by Theorem 32 (with Theorems 9 1>d== and 12,. d _), 


(2) VxVyVz (xy)z=x(yz), Vx xl =x, Vx3!wxw=l k xz=yzDx=y. 

Example 7. In Example 6, (2) does not hold omitting Vx3!w xw=l 
(call it then (2')). For, the two assumption formulas of (2') are both 
t and the conclusion is f, when D is the rational numbers and 1, xy, 
0, 1, 0 are the values of 1, xy, x, y, z. (In effect, xz=yzDx=y lets us 
prove 0=1 from 0 0= 1-0 by “dividing by 0”.) 

Corollary. If F k Vx, . . . Vx„3!wF(x,, . . . , x„, w), and V is consistent 
in the sense that for no formula A do both F k A and F k iA hold, then 
so is T, Vx, . . . Vx B F(x„ . . . , x„, f(x„ .... xj). 

Now we observe that Theorem 32 and its Corollary hold even with 
“3w” in place of “3!w” (call the theorem then Theorem 32a). The only 
difference in the proof is that, for each x v . . . , x n as values of x„ . . . , x„, 
the h> is not completely determined, but a particular one must be chosen 
from the class of w’s such that F(x,, . . . , x„, w) is t when vr is the value 
of w. 119 

Theorem 32a justifies the use of “proper indefinite descriptions”, i.e. 
expressions of the form “a w such that F(x,, . . . , x,„ w)” where 
Vx, . . . Vx,,3wF(x„ . . . , x„, w) is available (n > 0). 

Extreme care should be used, however, in adopting f(x„ . . . , x„) as a 
translation of “a ip such that F(x„ .... x,„ n>)” in any extended argument. 
For, with each set of values of x„ . . . , x,„ the term f(x„ .... x„) in all 
its uses under the assumption (ii) has to express the same one of the ip’s 
such that F(x lr . . . , x n , ip). (This is reflected in the equality axioms for f, 
§ 29.) In ordinary discourse, different uses of “a w such that 
F(x u . . . , x n , vp)” in an extended context are likely to be independent of 
one another. In fact, uses of “a” or “an” to describe a member of a set 
ipF(x,, . . . , x n , ip) are usually so transitory that an adequate translation 


n “ The principle that, when Vx, . . . Vx„3wF(x„ . . . , x„, w) is t, we can, for each 
x „ in D, choose such a ip in D as the value of a fixed function /(x„ . . . , x„), 
is a form of the axiom of choice (§ 35 below). 

For n = 0, Theorem 32a is hardly different from 3-elimination (with k for t, and with 
F(w) in place of A(x)): the individual symbol f in (ii) takes the place of the variable w 
held constant in “J\ F(w) k C”. 
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can be given just using a variable as the name of the object described. 
Examples: “Socrates is a man” (3x[s=x & H(x)], or simply H(s) as in 
Example 19 § 26J, “A child needs affection” (Vx[C(x) D A(x)] as in § 27), 
“A man was here” (3x[M(x) & H(x)]). 

Exhiu isis. 31.1. Prove Corollary Theorem 32. 

31.2*. Establish (1) in Example 6. !2U 

31.3. Show that 

Vx3 !wF(x, w), VxF(x, f(x)) f F(x, w) ~ f(x) = w. 

Thus, in operating with a proper description, F(x, w) and f(x)=w are 
equivalent. 

31.4. Criticize the following arguments. 

(a) Henry and Jane are brother and sister (F(h, j)). Therefore Jane is the 
sister of Henry (j = f(h); cf. Exercise 31.3). Flenry and Edith are brother 
and sister (F(h, e)). Therefore Edith is the sister of Henry (e = f(h)). 
Therefore (by (e) and (f) § 29) Edith and Jane are the same person (e = j). 

(b) By the meaning of “the sister of John”, John and the sister of John 
are brother and sister (F(j, f(j)). Therefore (by 3-introd.), John has a sister 
(3wF(j, w)). 

liu In substance the solution is the proof of Til in § 39. (A direct proof of (2) with 
“f” is only slightly longer. In more complicated examples the gain by using Theorem 
32 or a proof-theoretic version of it can be greater.) 
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§ 32. Countable sets. In the present century, work on mathematical 
logic and work on the foundations of mathematics have been closely 
connected. Problems and ideas about the foundations of mathematics 
have contributed much to the development of logic, and logic has been a 
primary tool in the investigation of those foundations. In this Part II of 
the book, we shall survey this common area. We shall both acquaint 
ourselves with new developments, and examine more carefully some 
notions which underlay the discussion in Part I. 

We begin with some points in Cantor’s theory of sets, which dates from 
the discoveries he published in 1874 concerning the comparison of 
infinite collections. 

Suppose we wish to know whether one collection is less numerous or 
equally numerous or more numerous than another. For finite collections, 
we can settle this by attempting to “match” or “pair” the members of the 
sets, or as we shall say hereafter to put them into one-to-one (1-1) corre- 
spondence. If the two sets can be put into 1-1 correspondence, they are 
“equally numerous” or as we shall say have the same cardinal number. 
However, the idea of such a correspondence is more primitive than the 
idea of “cardinal number”, as the following example illustrates. 

In a tribe of aborigines who cannot count beyond twenty, a chief is to 
be chosen from two candidates A and B by awarding the position to the 
candidate with the larger herd of cattle. The two herds are run through a 
gate, with a pair of animals, one from each herd, always passing through 
together, until one or the other herd or both are exhausted. If A’s herd is 
exhausted before B’s, B becomes chief; and vice versa, A wins if he has 
animals left when all of B’s have gone through. If the last two animals 
walk through together, a different method of selection must be used, or a 
co-chieftancy established. Though each herd may have more than twenty 
cattle, so it could not be counted by the tribe, this method of pairing 
works. 

In 1638, Gallileo noted the “paradox” that the squares of the positive 
integers can be placed in 1-1 correspondence with all the positive integers, 
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contrary to the axiom of Euclid that the whole is greater than any of its 
proper parts, i.e. parts not the whole. 121 

Thus with infinite collections, putting one collection into 1-1 corre- 
spondence with a proper part of the other does not exclude the possibility 
that under a different method of pairing the wholes may correspond 1-1. 
With the two herds, this could not happen: if B wins the chieftainship on 
one way of running the herds through the gate, A is certain (though he 
has not seen a mathematical proof) that he could not have tied or won by 
sending his animals through in a different order. 

We assume familiarity with the sequence of the natural numbers (or 
nonnegative integers ) 122 

0, 1, 2, 3, 4, 5 

Collections which can be placed in 1-1 correspondence with the natural 
numbers we call countably infinite or denumerably infinite or enumerably 
infinite. To show that an infinite set is countable, we need only put its 
members into an infinite list, or 1-1 correspondence to the natural numbers 
as written above. A particular such list, or 1-1 correspondence to the 
natural numbers, we call an enumeration of the set. Examples of countably 
infinite sets are, besides the natural numbers themselves, the set of the 
positive integers, the set of the squares of the positive integers, and the 
set of all the integers, as we see from the following enumerations: 

1, 2, 3, 4, 5, 6, 

1, 4, 9, 16, 25, 36, 

0, 1, -1, 2, -2, 3, 

A countable set or denumerable set or enumerable set is a set which is 
either countably infinite or finite. Here we can describe a finite set as a 
set which can be put into 1-1 correspondence with an initial segment 

0 n — 1 of the natural number sequence, possibly the empty 

segment (/i = 0). This is equivalent to saying that a finite set is a set that 

121 That an infinite collection can be put into I - 1 correspondence with a proper part of 
itself was noticed a number of times before Gallileo. Steele (in his historical introduction 
to his 1950 translation of Bolzano 1851, who noted the same) cites Plutarch (467-125? 
A.D.) and Proclus (412-485 A.D.). Thomas 1958 cites Adam of Balsham (Parvipon- 
tanus) 1 132, and Pierre Duhem “Le systeme du monde” 1954 ed. vol. 7 p. 123 cites 
Robert Holkot (d. 1349). Julius R. Weinberg supplied these references and also 
“Centriloquium Theologicum”, Conclusio 17 (erroneously attributed to William 
Ockham (d. 1349?)). 

122 Some authors use “natural numbers” as a synonym for “positive integers” 
1 , 2, 3, . . ., obliging one to use the more cumbersome name “nonnegatise integers” 

for 0, 1,2 Besides, in this age, we should accept 0 on the same fooling with 

1, 2, 3 
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has a natural number n as its “cardinal number”, as this use of natural 
numbers is commonly understood. 123 

Another example of a countably infinite set is the set of the rational 
numbers. This is surprising, if one first compares them with the integers in 
the usual algebraic order. The points on the z-axis with integral abscissas 
are isolated, while those with rational abscissas are “everywhere dense”, 
i.e. between each two, no matter how close, there are always others. We 
can nevertheless enumerate them by the following device. We begin with 
the fact that each rational can be written as a fraction of integers with a 
positive denominator. We arrange all such fractions in an infinite matrix, 
thus: 

° 1 -1 2 -2 3 

T T "*■ ~ T T 1 1 

1 / / / / 

° 1 -I 2 -2 3 

2 2 ~2 2 2 2 

/ / / 

0 1 -1 2 -2 3 

3 3 3 3 3 3 

1 / s 

0 1 - 12-23 

4 4 4 4 4 4 


We can enumerate these fractions by following the arrows. Finally, we 
can go through that enumeration striking out each fraction which, 
interpreted as a rational number, is equal in value to one that has preceded 
it. This gives us the following enumeration of the rational numbers: 

0, 1, — 1, 2 , 2> 2, 2, j, J, .... 

Another enumerably infinite set is the set of the real algebraic numbers, 
that is, the set of real numbers which are roots of algebraic equations 
(polynomial equations) in one variable x with integral coefficients, such as 
the equation 

4z 5 - 17a; 3 + 2a; 2 + 5 = 0. 

The general form of an algebraic equation of degree n (> 1) is 

a 0 x n + a^”- 1 + . . . + a n _^x + a n = 0 (a Q ^ 0). 

This is surely the more intuitive definition of finiteness of a set. But the property 
of the set of the positive integers noted in Gailileo’s “paradox” is characteristic of 
infinite sets. So, as Peirce 1885 and Dedekind 1888 proposed, a finite set can be defined 
alternatively as a set which cannot be put into 1-1 correspondence with a proper part 
of itself. Cf. 1M pp. 13, 14 (after reading § 34 below). 
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If we can enumerate the algebraic equations, we can enumerate the real 
algebraic numbers. For then, in the enumeration of the equations, we 
can replace each equation by its distinct real roots, which will be finitely 
many (at most as many as its degree), to obtain an “enumeration with 
repetitions’’ of the real algebraic numbers. Then the repetitions can be 
removed. The algebraic equations with integral coefficients can be 
enumerated by first noticing that we can without ambiguity simply write 
the exponents on the line with the rest of the symbols (thus: 

4*5 — 17x3+ 2*2+ 5=0). Then the equations become finite sequences of 
just fourteen different symbols: 

0123456789*+- = 

The first symbol in an equation is not a 0. Now we can regard the fourteen 
symbols as the digits in a quattuordecimal number system, i.e. a number 
system based on 14 in the same way that the decimal system is based on 
10. When we do so, each equation becomes an expression for a natural 
number, indeed a positive integer, in that system. Of course, not all natural 
numbers when written in the 14-system with the above symbols as the 
digits will read as algebraic equations. By leaving out-the natural numbers 
which do not (closing up the “gaps”), we get an enumeration of the alge- 
braic equations; that is, the algebraic equations can be listed in the order 
of magnitude of the positive integers which they become on interpreting 
their symbols as the digits in a 14-system. 

We call the method just used for enumerating the algebraic equations 
the method of digits. We use it now to establish the following general 
principle. 

(A) If all the members of a set S can be named unambiguously by non- 
empty finite sequences of ( occurrences of) symbols from a given finite list 
of symbols or alphabet s 0 , . . . , s J ,_ 1 (or even from a countably infinite 
alphabet s 0 , s 1( s 2 , . . .), then the set S is countable. 

In the case of a finite alphabet s 0 , . . . , s fl _i, we could proceed as above 
(where p — 14 and s 0 , . . . , s,,., are the fourteen symbols displayed), 
except for one contingency. After regarding a finite sequence of (occur- 
rences of) s„, . . . , Sj, ! as an expression for a natural number in the 

/^-system with s„, . . . , s^ as the digits, we cannot tell from that number 
as a number how many initial s„’s there were in the sequence. Thus, in 
the 14-system with the above digits, the same number is expressed by each 
of: 4*5—17*3+2*2+5=0, 04*5 — 17*3+2*2+5=0, 

004*5—17*3 + 2*2+5=0,.... This ambiguity did not matter above, 
since we could and did exclude algebraic equations beginning with a 0. 
To avoid the ambiguity about the number of initial s 0 ’s when it does 
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matter, we can instead regard the symbols s 0 , . . . , s ( ,_ x as the digits for 
1, ...,/? in a p+ 1-system having a different symbol for 0. 

Notice that it does not matter whether the members of S have only 
one name each or several names using the alphabet s 0 , . . . , s^. If 
members of S may have several names, then when we eliminate the 
numbers not arising from names we can also eliminate all but the smallest 
number which we get from the several names of any one member of S. 

The case of a countably infinite alphabet s 0 , s t , s 2 , . . . can be reduced to 
the case of a finite alphabet by replacing each of the infinitely many 
symbols by a suitable combination of finitely many symbols, For example, 
we can pick two symbols a and b, and then replace s 0 , s^ s 2 , s 3 , . . . by 
a, ab, abb, abbb, .... Thereby e.g. the name s^s^ becomes aabbbabab, 
from which without ambiguity we can recover s^s^. The method of 
digits can be applied to the new two-letter alphabet a, b; thus we can 
interpret aabbbabab as expressing a number in the ternary system with 
0, a, b as the digits for 0, 1, 2. In particular cases, some other reduction 
of s 0 , s l5 s* ... to a finite alphabet may be more convenient to use. 

Conversely: (B) If a set S is countable, its members can be named 
unambiguously by nonempty finite sequences of ( occurrences of) symbols 
from a given finite alphabet. For, if S is infinite and a 0 , a u a 2 , . . . is a 
particular enumeration of S, then a 0 can be named by 0, a x by 1, o 2 by 
2, , using the 10 symbols 0, 1, .... 9. Briefly, each member a t of S 

can be named by (the numeral for) its index i in a given enumeration 
a 0 , a i< a z> • • • of S. Similarly, if 5 is finite. 

Together, (A) and (B) should make it virtually obvious what sets are 
countable (i.e. finite or countably infinite). Other devices than the method 
of digits are often convenient for enumerating sets. 

Exercises. 32.1. Show that, in the method of digits for a finite alphabet 
s 0 , ... , the names of the members of S' are simply enumerated by 
taking all the one-letter names first, then all the two-letter names, then all 
the three-letter names, .... using alphabetic order within each group. 

32.2. Use the method of digits to show the rational numbers countable. 
Give the first six rational numbers in the enumeration you obtain. 

32.3. Show that the following sets are countable (use both the idea 
employed in the text for the rationals, and the method of digits): 124 

(a) The ordered triples (a, b, c) of natural numbers, or of the members of 
any given countably infinite set, 

(b) The finite sequences of such members. 

(c) The finite sets of such members. 

(d) The finite sequences of finite sequences of such members. 

144 (I, 2, 3), (2, I, 3), (2,1, 1,3) are different finite sequences; but the sets with members 
shown in curly brackets {I, 2, 3), (2, 1,3}, {2, 1, I, 3} are the same. 
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32.4. Criticize the following argument: “Every real number x can be 
named unambiguously by an integer plus an infinite decimal fraction 
X + .x l x 2 x a . . . (e.g. v: = 3 + .14159 . . . , — f = — 1 + .24999 . . .). Only 
finitely many symbols 0, . . . , 9, . are used in the names. There- 
fore by the method of digits (or (A)), the set of the real numbers is 
countable.” 

§ 33. Cantor’s diagonal method. The results in § 32 on the appli- 
cation of the notion of 1-1 correspondence to infinite sets would perhaps 
have found their place in mathematical history as an interesting curiosity, 
not noticed before (or noticed earlier and forgotten) but leading nowhere 
especially, if it had turned out that each two infinite sets can be put into 
1-1 correspondence. The fruitfulness of the idea of comparing infinite sets 
under 1-1 correspondences appears in that this is not the case. For, as we 
show next, there are uncountable or nondenumerable sets, i.e. infinite sets 
which cannot be put into 1-1 correspondence with the natural numbers. 

We begin with the set of the one-place number-theoretic functions. 
These are the functions of one variable a ranging over the natural numbers 
with values also natural numbers. Examples are a 2 , 3«+l, 5 (a constant 
function), a (the identity function), [V a] (the greatest integer < V a), 
etc. 125 

To prove the uncountability of the set of (all) these functions, suppose 
given an enumeration f 0 (a), fi(a), / 2 (a), ... of some one-place number- 
theoretic functions (not necessarily all). We shall thence construct a 
one-place number-theoretic function f(a) different from each function in 
the given enumeration. This will prove that the given enumeration cannot 
be an enumeration of all the one-place number-theoretic functions. To 
help us in visualizing the construction of /(a), let us tabulate the sequences 
of values of the functions f 0 (a), ffa), /.(a), ... as the rows of an infinite 
matrix: 



a 

0 

i 

2 


m 

MO) 

foil) 

/o(2) 




\ 


(1) 

m 

MO) 

/i(!) 

\, 

/i(2) 


fM) 

MO) 

/ed) 

M2) 


125 We have formulas (i.e. finite names) for these live functions. But also functions 
are to be included whose successive values are determined by laws not corresponding to 
ordinarily used names, and even functions whose successive values are picked “irregu- 
larly” or in a “random” manner. 
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We define f(a) to be the function whose sequence of values is obtained by 
taking the successive values along the diagonal (indicated by the arrows) 
and changing each of them, say by adding 1 to it; thus, 


f{a) —faj a ) + 1 . 

This function does not occur in the given enumeration f 0 (a), f x (a), 
f 2 (a), . . .. For, it differs from f,(a) in the value taken for 0, from f x {a) in 
the value taken for 1, and so on. 

(Thus, if f 0 (a) = a 2 ,f x (a) = 3<j+ l,/ 2 (a) = 5 ,/ 3 (a) = a,f t (a) = [Va] 

then / 0 (0) = 0, /,( 1 ) = 4, /,( 2) = 5, f 3 ( 3) = 3, f 4 ( 4) = 2, . . . 

but /( 0) = 1 , /( 1 ) = 5, /( 2) = 6, /( 3) = 4, /( 4) = 3, . . ..) 

To phrase the argument differently, suppose that the function f(a) 

were in the enumeration f 0 (a), f x (a), f t (a), . . . ; i.e. suppose that for 
some natural number p, 

f(a) =/» 

for every natural number a. Substituting the number p for the variable a 
in this and the preceding equation, 


f(p) =Up) =Up) + i. 

This is impossible, since the natural number f,(p ) does not equal itself 
increased by I. 

The method we have just used is called Cantor's diagonal method. 
By restricting the set of the functions to which we apply it, we next obtain 
some other examples of uncountable sets. 

We can take just those one-place number-theoretic functions, each of 
which has only 0, 1,2, 3, 4, 5, 6, 7, 8, 9 as values and which does not have 
all its values 0 from some value on. Then the rows of the matrix (1) can be 
interpreted as the nonterminating decimal fractions for real numbers x 
in the interval 0 < x <, 1 . Each such real number has just one such 
decimal expansion; e.g. J = .74999 . . . (we don’t use .75000 . . . = .75, 
which is “terminating”), I/V 2 = .2071 1 . . . , 1 = .99999 . . . , 

7t — 3 = .14159 . . . , 2/3 = .66666 . . .. The alteration performed along 
the diagonal must not take us out of the class of functions considered. We 
can, say, change each value 5 to 5 and 5 to 6; thus 


5 'f /«(«) 5* 5, 

6 if fja) = 5. 


(If = f, x x = l/v/2, # z =l, x 3 — TZ — 3, #4 = f, . . . , we obtain 
a real number x whose decimal expansion (necessarily nonterminating) 
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starts out .55565 . . ..) This application of the diagonal method estab- 
lishes that the set of the real numbers * in the interval 0 < x < I is 
uncountable. 

It follows almost at once that the set of all the real numbers is un- 
countable (Exercise 33.1 (a)). 

It is interesting historically to note how Cantor’s discoveries in 1874 
illuminated an earlier discovery of Liouville in 1844. Liouville constructed 
by a somewhat complicated special method certain transcendental (i.e. 
nonalgebraic) real numbers. Cantor’s diagonal method makes the 
existence of transcendental numbers apparent from only the very general 
considerations presented above. In fact, from any given enumeration of 
the algebraic numbers, particular transcendental can be obtained by the 
diagonal method. 

Finally, let us apply the diagonal method to the set of the one-place 
number-theoretic functions taking only 0 and 1 as values. Now we have no 
choice about the alteration performed along the diagonal. We must 
interchange 0 and 1 ; thus the "diagonal function” is defined by 



if /» = 0, 
if /,(«)= 1.* 


In this application, each function can be interpreted as describing the 
set of natural numbers at which the function value is 0; we call these 
functions the representing functions of these sets. We show some examples 
of sets of natural numbers at the left below, opposite the sequences of 
values of their representing functions. 



0 

1 

2 

3 

4 

.S',, = all numbers {0, 1, 2, 3, 4, . . .} 

0 

\ 

0 

0 

0 

0 

S 4 = the even numbers {0, 2, 4, . . .} 

0 

1 

* \ 

0 

1 

0 

S 2 = the squares {0, 1,4,.. .} 

0 

0 

I 

\ 

I 

0 

S 3 = the primes {2, 3, . . .} 

1 

1 

0 

0 

1 

\ 

S 4 — the empty set { } 

1 

1 

1 

1 

1 


For S 0 , S 2 , S 3 , S 4 , . . . as shown, the diagonal method gives the set 

^ = {1, 2, 4, . . .} whose representing function starts out with the values 
1, 0, 0, 1,0,.... Thus 0 is not a member of S, though it is of S 0 ; 1 is a 
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member of S, though it is not a member of Sp, etc. So S is not any of S 0 , 
S u . . . . This application of the diagonal method shows that the set of all 
the sets of natural numbers is uncountable (in contrast to Exercise 32.3 (c)). 

The sets shown in this section (including the exercise) to be uncountable 
can be put into 1-1 correspondence with one another (they are “equiva- 
lent”). 126 Closer examination of the diagonal method in § 34 will show 
that it gives us infinite sets which are neither countable nor “equivalent” 
to any of these uncountable sets. 

Exercise 33.1. Show the following sets to be uncountable. 

(a) The real numbers. 

(b) The transcendental numbers. 

(c) The one-place logical functions (§ 17) when D = {0, 1,2,.. .}. 

§ 34. Abstract sets. Starting from these discoveries, Cantor de- 
veloped a theory of abstract sets, in which he gave them a general setting 
and attempted to discuss sets of the most general sort. 127 We can give 
here only a brief indication of the theory of abstracts sets. 128 

Cantor (1895 p. 481) defined a set thus, “By a ‘set’ we understand any 
collection M of definite well-distinguished objects m of our perception 
or our thought (which are called the ‘elements’ of M ) into a whole.” We 
write “m e M" to say that m is an element of M, or synonymously that m 
is a member of M or belongs to M; and “m £ M ” to say that m does not 
belong to 3/. 129 Two sets A/phnd A/ z are the same (M x = M 2 ) if they have 
the same members. A finite set can be described by listing its members (the 
order being immaterial) in curly brackets; using dots, we can even suggest 
thus an infinite set. For example, {1, 2, 3} is a set with three elements, and 
(0, 1, 2, . . .} is the set of the natural numbers. We say a set A/, is a 
subset of a set M, and write M x £ M (or M 2 M x ), if each member of 
M x is a member of M. For example, the three-element set {I, 2, 3} has eight 
subsets 

{},{!}. {2}, {3}, {1,2}, {1,3}, {2, 3}, {1,2, 3}. 


The first { } is the empty (or vacuous ) set (often written 0), the next three 
are unit (i.e. one-element) sets, and the last is the improper subset of 
{1,2, 3}. 


’** A proof (to follow § 34) is given in IM pp. 16, 17. 

117 Cantor also developed a theory of point sets, much of which has now become 
familiar in the form of point set topology. 

128 Cantor 1895-7 is quite readable, and has been translated into English (cf. our 
bibliography). Fraenkel 1961 gives an excellent and very full account. A concise treat- 
ment is in Bachmann 1955. 

121 We arc repeating a little from § 26 above to make the present chapter largely 
self-contained. 8 ’' 91 ' 22 
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The cardinal number M is a notion which we “abstract” from M and the 
other sets which can be put into 1-1 correspondence with M. Thus a child 
gets his concept of “two” by abstracting from two parents, two ears, two 
apples, two kittens, etc. What “two” itself is may not worry him. Cantor 
wrote, “The general concept which with the aid of our active intelligence 
results from a set M, when we abstract from the nature of its various 
elements and from the order of their being given, we call the ‘power’ or 
‘cardinal number’ of M.” This double abstraction suggests his notation 
“A/” for the cardinal of M. Frege 1884 and Russell 1902 defined M as the 
set of all the sets N which can be put into 1-1 correspondence with M; 
this gives a place to the cardinals themselves as objects of a universe whose 
only members are sets. To put this in terms that were elaborated in § 30, let 
us say M is equivalent to N, and write M ~ N, if M can be put into 1-1 
correspondence with N. Then ~ is an equivalence relation (i.e. it is 
symmetric, reflexive and transitive); and M is the equivalence class 
to which M belongs under ~ within a universe of sets (cf. (B) in § 30). 

Whatever one's ontology of the cardinals, M = N if and only if M ~ N. 

For any set M, we write “2 M ” as a notation for the set of all the 
subsets of M. 

In our present notation, our last result in § 33 is that 2 M is uncountable, 
or 2 V jt M, when M — {0, 1, 2, . . .}. The reasoning applies similarly to 

any set M to show that M 2 M . Let us repeat the reasoning for the 
case M — {1,2, 3}. Given any set M x c 2 J/ which is in 1-1 correspondence 
with M itself, the diagonal method leads us to a subset of M (i.e. a member 
of 2 W ) not belonging to M v Thus, if M x — {{2}, (2, 3}, {1, 2}}, the matrix 
is as follows: • 

1 2 3 

{2} 

{2, 3} 

{ 1 , 2 } | 0 0 1 

Interchanging 0 and 1 on the diagonal, we are led to {1, 3}, which is a 
member of 2 ,/ (2 1/ has the eight members displayed above) but not of 
M x . If instead A/, = {{1,2,3}, {1}, {3}}, we get {2}, which again is not 

in M x \ etc. Of course, since 2 W has eight members, i.e. 2 M = 8, while 

M = 3, we can say we already knew that M ^ 2 M for M — { 1,2, 3}, as 
part of what we are taking for granted about the natural number sequence. 
However, this and the concluding example in § 33 with M — {0, 1,2,.. .} 
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arc two illustrations of the general proof that, for each set Af, M # 2‘ w . 

This result can be given a stronger form. First, we define M < N (or 
N > M) to hold exactly if M is equivalent to some subset of N but N is 
equivalent to no subset of M (i.e. if there is a set such that M ~ JVj £ N 
but there is no set M x such that M x £ A/). Here it is necessary to 
verify that the result is independent of which sets M and N with the 
respective cardinals M and N we use, or in the terminology of § 30 of 
which^ members M and N we pick from the two equivalence classes M 
and N (Exercise 34.1). We see almost at once that the order relation < 
between cardinals is irreflexive (i.e. M <£ M) and transitive (i.e. if M < N 
and N < P, then M < P) (Exercise 34.2). 

As we already remarked in § 32 assuming familiarity with the natural 
number sequence, the natural number n is the cardinal of the initial 
segment 0, ...,«— 1 of the natural numbers. Consequently, Cantor’s 
definition of the order relation < between any two cardinals applies in 
particular to the natural numbers as cardinals. It can be shown (though 
not without some trouble) that this order relation < between the natural 
numbers as finite cardinals coincides with the familiar one which we have 
been taking for granted. 130 

By only slight refinements of the argument given above, we can 

establish Cantor’s theorem: (C) For each set M, M < 2 M , 

Another easily proved theorem concerns the union U M of a set M 
whose members are sets. KJM has as its members each of the objects 
which is a member of a member of M. For example, if M = {{2}, {2, 3}, 
(2, 5}}, then V>M = {2, 3, 5}. The theorem is: (D) If M is a set of 
sets containing none of greatest cardinal (i.e. to each member A of A/ there 

is a member A' of M with A' > A), then A < \JM for each member A of 
M. 

We adopt the notation N 0 (read “aleph null” or “aleph zero”) for the 
cardinal of the set of natural numbers. Then for each natural number n, 

n < X 0 . This follows from (D) with n = {0, 1, . . . , n— 1} and the fact 
that the cardinal order of the natural numbers is also their usual order. 

Also we write 2' u as ”2- w ”. For M finite, this is consistent with the usual 

arithmetic; e.g. we saw that 2 W = 8 = 2 3 when M = 3. 

The foregoing results give the existence of the following ascending 
series of cardinals 

0 < 1 < 2 < . . . < X 0 < 2 N ° < 2 2X “ < . . . , 

130 A treatment is given in IM pp. 12, 13 with Example 1 §7 p. 22. 
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while by (D) there is a still greater cardinal after all of these, and so on 
indefinitely. Thus, by applying the idea of comparing sets by one-to-one 
correspondences. Cantor discovered that there is not simply one infinity, 
but a whole hierarchy of different infinite (oiy “transfinite”) cardinals. 

lixi iu isr.s. 34.1. Justify the definition of “M < N" by showing that, 
if \1 — \l' and A' — - N', then (a) holds if and only if (a') holds: 

(a) For some N x , M ~ — (V; but for no N ~ A/, £ M. 

(a ) For some Ay, M' ~ A/, £ N' \ but for no M u £ A/'. 

34.2. Show that the relation M < N is irreflexive and transitive. 

34.3*. Prove (C) and (D). 

34.4. (a) What is the cardinal of the set in Exercise 33.1 (c)? (b) What 
is the cardinal of the set of the one-place logical functions when the 
domain is the sets of natural numbers? 

§ 35. The paradoxes. The relation between Cantor’s set theory and 
mathematics was like the course of true love; it never did run smooth. 

Cantor’s set theory deals with “actual” or “completed” infinities. At 
the beginning there was great resistance to this by the/nathematical public, 
stemming in part from the famous dictum of Gauss (1831): “I protest . . . 
against the use of an infinite magnitude as something completed, which is 
never permissible in mathematics: one has in mind limits which certain 
ratios approach as closely as desirable while other ratios may increase 
indefinitely.” (Werke VIII p. 216.) Gauss had in mind infinite magnitudes, 
while Cantor's theory employs infinite collections. 

Just when Cantor's ideas were well on the way toward winning accept- 
ance from most mathematicians, in the I890’s contradictions appeared in 
the upper reaches of his set theory. Nevertheless, since then set theory 
(suitably adapted) has increased its place in mathematics, while the para- 
doxes have focussed attention on the foundations of set theory and of 
mathematics generally. 

The Burali-Forti paradox which appeared in 1897 (but was known to 
Cantor in 1895) arises in Cantor’s theory of ordinal numbers, which we 
have not discussed. 183 

Russell’s paradox 1902a concerns the set of all sets which do not con- 
tain themselves. Call this set S. Suppose (a) S contains itself. Then by the 
definition of S, S does not contain itself. So by reductio ad absurdum 
(rejecting the supposition (a)), we have proved', (b) S does not contain 
itself. But then by the definition of 5: (c) 5 does contain itself. Together 

(b) and (c) constitute a proved contradiction, or paradox. 

Russell in 1919 gave the following popularized version: The barber in 
a certain village shaves all and only those persons in the village who do 
not shave themselves. Question: Does he shave himself? 
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We now give in detail a third paradox of the theory of sets, the Cantor 
paradox (found by him in 1899). Let T be the set of all sets. Now 2 T is a 
set of sets, and hence 2 T £ T. By the definition of < for cardinals (§ 34), 

if M £ TV then M > N. (Why?) So 2 r > T. But by Cantor’s theorem (C), 

2 r > T. Thus we have a contradiction. 

One may try to dismiss this by saying that the set T of all sets does not 

constitute a set. Then in Cantor’s theorem “For each set M, M < 2 M ", 
what is the range of the variable M (what we called the domain D in 
Chapter II)? 

Of a somewhat different type is the Richard paradox (Jules Richard, 
1905), which runs as follows. 

By a “phrase” we shall mean any finite sequence each of whose members 
is either a blank or one of the twenty-six letters of our alphabet (with a 
blank not first or last). Thus, “abracadabra”, “of cabbages and kings”, 
“the square of a”, and “xtu rlbp” are phrases. We can enumerate the 
phrases by the method of digits (§ 32), using a 27-digit or 28-digit number 
system to represent the natural numbers. Certain phrases, such as our 
example “the square of a”, describe in the English language one-place 
number-theoretic functions. We now strike out from our enumeration of 
all the phrases those which do not describe such functions; thereby we 
obtain an enumeration P 0 , P*, . . . of the phrases which do. Say the 

functions described are ffa),ffa),ffa) respectively. 

Now consider the following phrase: “the function whose value for each 
natural number a is obtained by adding one to the value for a of the 
function described by the phrase corresponding to a in our enumeration 
of the phrases which describe one place numbertheoretic functions”. In 
this phrase, we could replace the last part “in our enumeration of the 
phrases which describe one place numbertheoretic functions” by a detailed 
description of the exact construction of the enumeration, and so obtain 
from the whole another phrase P fully describing the same function. 

This phrase P describes a number-theoretic function, namely 

/(«) = /»(«) + 1. 

Hence P occurs in the enumeration P 0 , P lf P 2 , . . .. This is impossible, since 
the function described by P differs from that described by P 0 in its value 
for a = 0, from that described by Pj in its value for a = 1, from that 
described by P 2 in its value for a = 2, and so on. Otherwise expressed: 
Since the phrase P occurs in the enumeration P 0 , P 1( P 2 , .... it is P„ for 
some p. Then 


m =/,(«). 
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A contradiction arises by substituting p for a in this and the preceding 
equation. (In Richard’s original version, real numbers were used instead 
of one-place number-theoretic functions.) 

This paradox is closely connected with the facts that, on the one hand, 
only a countable infinity of number-theoretic functions are describable in 
a given language (because the set of all the phrases in the language is only 
countably infinite, § 32), while, on the other hand, the set of all the number- 
theoretic functions is uncountable (by Cantor’s diagonal method, § 33). 

A similar paradox is due to G, G. Berry (in Russell 1906 p. 645). Con- 
sider the expression “the least natural number not nameable in fewer than 
twenty-two syllables”. This expression names a definite natural number, 
say n, since each nonempty set of natural numbers (in this case, the set 
of natural numbers not nameable in fewer than twenty-two syllables) has 
a least element. By its definition, n is not nameable in fewer than twenty- 
two syllables. But our expression naming n has in fact exactly twenty-one 
syllables! 

These modern paradoxes are related to the paradox of “The Liar”, 
which comes from antiquity. 131 The following statement is attributed to 
the Cretan philosopher Epimenides, sixth century b.c. “All Cretans are 
liars”. Let us suppose that by “liar” Epimenides meant a person who 
never tells the truth. 

Suppose his statement is true; then by what it says and by his being a 
Cretan, it is false, which is a contradiction. So by reductio ad absurdum, 
the statement is not true, i.e. it is false. This means that at some time some 
Cretan has told the truth or eventually some Cretan will tell the truth. 
This, however, should be a matter for the historian to decide; it should not 
be demonstrable on logical grounds only, as we appear to have demon- 
strated it. 

The direct form of the paradox of “The Liar” was given by Eubulides 
in the fourth century b.c. We can give it as follows: “The statement I am 
now making is a lie.” One sees directly that the quoted statement cannot 
be true and that it cannot be false. 

In the ancient “dilemma of the crocodile”, a crocodile has stolen a 
child, but offers to return the child to its father, if the father can guess 
whether or not the crocodile will return the child. If the father guesses 
that the crocodile will not return the child, the crocodile is in a dilemma. 

A missionary, fallen among cannibals, discovers that he is about to 
become their supper. They offer him the opportunity to make a statement, 
under the conditions that, if the statement is true, he will be boiled, and 
if it is false, he will be roasted. What should the missionary say? 

Cantor’s set theory, as historically it first arose and as we met it in 

131 For some historical details arid references, see Wcyl 1949 p. 228 Footnote 2. 
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§§ 32-34, is called “naive” set theory. In using Cantor’s “definition” of a 
set (§ 34), Cantor and we were guided only by our imagination in deciding 
which objects are sets. 

Cantor’s and the other paradoxes of set theory show the difficulties 
inherent in the attempts to develop the theory on an intuitive basis starting 
from Cantor’s definition of set. These difficulties pose the problem how 
to modify set theory so that contradictions do not arise. In fact, the problem 
goes further, and forces us to ask ourselves wherein we were deceived by 
methods of constructing and reasoning about objects which had seemed 
convincing before they were found to eventuate in paradoxes. Complete 
agreement among mathematicians on the cause of the paradoxes and the 
cure has not been attained yet (1967), and it seems problematical that it 
ever will. 

In the remainder of this section, we indicate briefly the least radical 
kind of reformulation of mathematics toward avoiding such paradoxes as 
the Burali-Forti, Cantor’s and Russell’s. (Ramsey 1926 classified the known 
paradoxes into two sorts, one now called “logical” including the three just 
mentioned, the other “epistemological” or “semantical” including 
Richard’s, Berry’s and “The Liar”.) 

This reformulation of mathematics begins with the observation that the 
paradoxes of set theory (the logical paradoxes) are associated with using 
“too large” sets, such as the set T of all sets in Cantor’s paradox. Since 
free use of our conceptions starting with Cantor’s definition of set led 
to the difficulty, Zermelo proposed in 1908 to restrict the sets to those 
provided by a list of axioms. These axioms are drawn up so that there is 
no apparent means to derive the known paradoxes from them. On the 
other hand, the axioms do suffice for the deduction of the usual body of 
classical mathematics, including abstract set theory short of the paradoxes. 

We give now, in our words, the list of seven axioms or principles which 
appear in Fraenkel 1961 with the pages where they appear. 132 (It is not 
intended that the student, so far as this course is concerned, should learn 
these axioms. We give them here to exemplify what sort of axioms are 
used.) Another axiomatic treatment is in Bernays and Fraenkel 1958. 

(I) (Axiom of extensionality, p. 14.) Two sets A and B are equal if (and 
only if) they contain the same members; i.e. A — B ~ (/4 £ B and B £ A). 

(II) (Axiom of subsets, p. 16.) Given a set A and a predicate P(x) 
meaningful for the members of A (i.e., for each x e A, either P(x) is true 


132 In the 1953 edition, the pages for (I)-(VII) are 21, 22, 24, 28, 42, 97, 123, but (VIII) 
does not appear. In Fraenkel and Bar-Hii!el 1958, (I)-(VIl) are reformulated and 
rearranged, and both (VIII) and (IX) (as follows) appear. 

(IX) (Axiom of foundation, or of grounding.) Every nonempty set A contains a mem- 
ber b such that A and b have no common member. 
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or P(x) is false), there exists the set x[x e A & P(x)] containing exactly 
those members of A for which P(x) is true. (This axiom is also called the 
"axiom of selection, or of segregation” or the “Aussonderungsaxiom”.) 

(HI) (Axiom of pairing, p. 18.) If a and b are different objects, there 
exists the set { a , b } containing exactly a and b. 

(IV) (Axiom of union, p. 20.) Given a set 5 of sets, there exists the set 
US containing just the members of the members of S. 

(V) (Axiom of infinity, p. 32.) There exists at least one infinite set: the 
set {0, 1, 2, . . .} of the natural numbers. (Fraenkel used {1, 2, 3, . . .}.) 

(VI) (Axiom of the power set, p. 72.) Given a set A, there exists the 
set 2 A whose members are all the subsets of A. 

(VII) (Axiom of choice, p. 90.) Given a disjointed set S of nonempty 
sets, there exists a set C which has as its members one and only one element 
from each member of S. ( S is disjointed if no two distinct members of S 
have an element is common.) 

(VIII) (Axiom of substitution, or of replacement, p. 199.) Given a set A 
and a single-valued function/ defined on A, there exists the set containing 
just the objects f{x) for x e A. 

A form of the axiom of choice was first explicitly noted as an assumption 
in Zermelo’s proof of his “well-ordering theorem” 1904, 1908a, from which 
it follows that each two cardinal numbers A and B are comparable (i.e. 
that either A < B or A = B or A > B). The form given here, which is 
Russell’s “multiplicative axiom” 1906a, suffices for the derivation of 
Zermelo’s form (and vice versa). 

The axiom of choice has been the subject of much research with a view 
to minimizing its use, singling out its consequences, or (Godel 1938, 1939, 
1940) defending it as an assumption which can be added to the other 
axioms of set theory without entailing a contradiction if the other axioms 
by themselves lead to no contradiction. 133 In 1963-4-Cohen showed that 
instead the negation of the axiom of choice can be added consistently 
(detailed exposition in 1966). Another treatment is in Scott 1966. 

At one point these axioms (as did the axioms given by Zermelo in 1908a) 
lack definiteness. This is in (II), where the notion of a predicate P(x ) 
meaningful for elements x e A is incorporated. This lack of definiteness 
was first remedied by Fraenkel 1922 and somewhat differently by Skolem 
1922-3. What is required is a specification of a class of admissible predi- 
cates P(x). In Skolem’s method, the rules for constructing the P(x)'s are 
formulated simply in the process of specifying the symbolism of the lan- 
guage in which the axioms are stated. 

The specification of the symbolism of a language, which is necessary for 
the purpose of being exact in logical deduction, must really be presupposed 
for the rigorous treatment of logic as in Chapters 1— III above. The seman- 

133 Godel 1947 provides a very readable exposition. 
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tical paradoxes (Richard’s, Berry’s, “The Liar”) show that care is necessary 
in this. That is, the language of a mathematical theory must be subjected 
to rules governing the formation of propositions somewhat akin to the 
rules listed above governing the existence of sets. We shall deal further 
with this. 

In some systems of axiomatic set theory (particularly Godel’s, 1940) two 
sorts of collections are considered explicitly, collections called “sets” 
which can not only possess members but also be members of collections, 
and others called “classes” which may not be taken as members of collec- 
tions. Each “set” is a “class”. The collection of all “sets” constitutes a 
“class”; but we are stopped from obtaining the Cantor paradox because 
this “class” is not a “set”. 

In the definition of validity in the predicate calculus (§ 17), we simply 
said the domain D is to be a “nonempty set or collection”. That was 
before we were ready to talk about a distinction between “sets” and 
“classes” in the present sense. We can get a little more leeway now for 
model theory by allowing D to be a nonempty “class”. For in the notion 
of validity, we do not need to take £> as a member of anything. The dif- 
ficulties which ensue when “too large” collections are treated as members 
are not involved in just using a collection as the range of variables. This 
gives an answer to the question raised above of what the domain D can be 
for set theory itself (where we asked about the range of M in Cantor’s 
theorem, M < 2 U ). (Except when we explicitly cite this passage, “class” 
will continue in this book to be a synonym for “set” as in ordinary usage.) 

§ 36. Axiomatic thinking vs. intuitive thinking in mathematics. 
Partly in connection with the broader aspects of the problem posed by the 
paradoxes (§ 35), we inquire now into the nature of mathematics and the 
scope of mathematical methods. 

The axiomatic-deductive method in mathematics is known to us from 
Euclid’s “Elements” (c. 330-320 B.c.), although there is a tradition that 
credits Pythagoras (sixth century b.c.) with the introduction of the method. 
By use of it, the body of geometrical knowledge was systematized. Euclid’s 
axiomatic system may be described roughly thus: “definitions” of certain 
primitive terms , such as “point”, “line”! “plane” are given, which are 
intended to suggest to the reader what is meant by those terms; certain 
propositions concerning the primitive terms, felt to be acceptable as 
immediately true on the basis of the meanings suggested by the definitions, 
are taken as axioms or postulates’, then other terms are defined in terms of 
the primitive ones; and other propositions, called theorems, are deduced 
by logic from the axioms. Axiomatics such as Euclid’s, in which meanings 
are given to the primitive terms initially, is called material axiomatics. 

One of Euclid’s postulates seemed less evident than the rest, the fifth 
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postulate or “parallel postulate”. 134 He used this in proving the theorem 
that, through a given point P not on a given line /, exactly one line can 
be drawn parallel to / (i.e. not meeting / in a point). Efforts were made 
from Euclid’s time on to prove this postulate from the others as a theorem. 
We now know these efforts could not succeed. 

For, Lobatchevsky in 1829 and Bolyai in 1833 worked out a system of 
geometry in which, through a given point P not on a given line /, infinitely 
many lines parallel to / can be drawn. It is apparent that the meanings of 
Euclid’s primitive terms in terms of physical space do not enable one to 
’ decide whether Euclid's parallel postulate is true or the contrary postulate 
■of Lobatchevsky and Bolyai. The differences in the resulting geometries 
•may be too small to show up in any measurements we can make in the 
[portion of space accessible to us, just as in some other times people have 
(thought the earth flat from the portion of it they could see. 

V So whether a proposition of Euclidean geometry is exactly true must be 
a property of the geometry as a logical system. But if Euclidean geometry 
is a valid logical structure, so is the Lobatchevskian geometry. For, as 
Felix Klein pointed out in 1871, the axioms of the plane Lobatchevskian 
geometry are all true when the primitive terms in them are reinterpreted 
so that “plane” is taken to mean the interior of a given circle in the 
Euclidean plane, “point” means a point inside this circle, “line” means a 
chord of this circle, and distances and angles are computed by formulas 
due to Cayley 1859. (Another such Euclidean model, applicable to a 
bounded portion of the non-Euclidean plane, was given in 1868 by 
Beltrami, who reinterpreted line segments as segments of shortest paths 
between points, or “geodesics”, on a “surface of constant negative 
curvature”.) 

In these models, we may observe that something new has been done with 
the axioms, not to be found in the earlier axiomatic thinking: the meanings 
of the primitive terms have been varied, holding the deductive structure of 
the theory fixed. Thus formal axiomatics arose, in which the meanings 
of the primitive terms, instead of being specified in advance, are left 
unspecified for the deductions of the theorems from the axioms. One is 
then free to choose the meanings of the primitive terms in any way that 
makes the axioms true. We have been representing this standpoint in our 
definition of “valid consequence” in model theory (§§ 7, 20). Especially 
in modern algebra, it has proved very fruitful to develop the consequences 
of systems of axioms regarded formally, such as the axioms of abstract 

131 It reads: If two straight lines in a plane meet another straight line in the plane so 
that the sum of the interior angles on the same side of the latter straight line is less than 
two right angles, then the two straight lines will meet on that side of the latter straight 
line. 
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group theory (cf. §39). The results deduced from the axioms of group 
theory, while leaving unspecified the set of elements and the multiplication 
operation, constitute a body of theory ready-made for diverse appli- 
cations. 

In formal axiomatics, the system of axioms may be investigated for such 
properties as the independence of one axiom from the others (by seeking 
an interpretation of the primitive terms which makes that axiom false and 
the others true), categoricity (i.e. that the elements in any two interpreta- 
tions can be put into 1-1 correspondence preserving all properties), etc. 135 

In this approach to axiomatics some questions arise. Why do we choose 
the axioms we do, and why should the resulting systems interest us? The 
answer is evidently that we may apply the resulting theory to systems of 
objects provided from outside the axioms by an interpretation of the primi- 
tive terms. Sometimes essentially different interpretations are possible (the 
axioms are then not categorical, but ambiguous)-, e.g. this is the case for 
the axioms for abstract groups. We should not wish to employ a system 
of .axioms satisfied under no interpretation; such a system we call vacuous. 
One of the problems in formal axiomatics is to show axiom systems non- 
vacuous. However, a system of objects used as an interpretation is often 
drawn from some other axiomatic theory; then we have a regress, which 
merely brings us to the question of the significance of that axiomatic theory 
instead. If at no stage is an application made outside of formal axiomatics, 
the whole activity must appear to be futile. We therefore conclude that, 
if we are not to adopt a mathematical nihilism, formally axiomatized 
mathematics must not be the whole of mathematics. At some place there 
must be meaning, truth and falsity. At the very least, when we say that 
in a given formal axiomatic theory a certain proposition is a theorem, we 
must believe this is true, i.e. that the proposition does follow from the 
axioms, though whether the proposition itself is true is being left out of 
account, since in formal axiomatics the deductions are carried out in 
advance of assigning meanings to the primitive terms (or disregarding any 
such assignment). 

As a further illustration of a mathematical proposition which is not 
intended to be asserted merely as a formal but meaningless consequence 
of axioms, consider the theorem (proved in number theory) that, for given 
integers a, b, c, we can find out whether or not integers x and y exist such 
that ax -(- by + c = 0, i.e. the theorem that there is a method of deciding 
whether or not the equation ax + by ■+■ c = 0 (a, b, c integers) is solvable 
in integers. Although the theory of the integers may have been established 

135 This kind of treatment of axiomatic systems is well presented in J. W. Young 
1911. Several of the topics mentioned briefly now will be elaborated below: independence 
proofs in § 57; categoricity in § 53; consistency proofs by interpretation in § 52. 
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axiomatically, this proof is intended to mean that, for any particular a, b, c, 
we can discover whether or not there are solutions. A student who could 
merely give a proof from axioms of the theorem that one can find out 
whether there are solutions or not, but could not do a problem. in which he 
found out, would not have acquired what the teacher intended to teach. 
Nevertheless, he would be doing all that should be asked of him if the 
theorem (that one can find out) were intended only in the sense of formal 
axiomatics. 

As the least drastic method of meeting the situation posed by the para- 
doxes, we described axiomatic set theory at the end of § 35. Here the 
axiomatics is to be understood in the formal sense, unless one is to try to 
retain an intuitive conception of sets, which it was presumed was exactly 
what the axioms were intended to supplant. However the present con- 
siderations show that the resort to a formal axiomatic theory, though it 
may offer considerable advantages, leaves open such problems as why the 
axioms are significant and whether or not they apply to any system of 
objects not merely similarly postulated as existing for some other formal 
axiomatic theory. 

Hilbert undertook to deal with these problems. He admitted that classical 
mathematics (i.e. the familiar mathematics using classical logic) contains 
much that goes beyond what is clearly meaningful and justifiable on intui- 
tive grounds, as indeed mathematicians generally were made to realize 
when in set theory they went too far and encountered paradoxes. But he 
proposed to save classical mathematics (short of paradoxes) by a program 
which we can roughly describe as follows. Classical mathematics should 
be formulated as a formal axiomatic theory, and then the theory should 
be shown to be consistent, i.e. free from contradiction. 

Before this proposal of Hilbert, first made in 1904, but not seriously 
undertaken by him and his co-workers until after 1920, consistency proofs 
had been given for formal axiomatic theories by means of a model or 
interpretation, in which all the axioms are found to be true when the 
primitive terms are interpreted in terms of another theory. We saw an 
example of this above, by which the non-Euclidean plane geometry of 
Lobatchevsky is shown to be consistent if Euclidean geometry is consist- 
ent. 135 In each case, a proof of consistency by a model only shows the 
theory consistent if another is. By Rene Descartes’ method of analytic 
geometry 1619, the consist ency of geometries generalJyjsJe<iU£ed_to that 
of the theory ofjeal members, i.e. to analysis. But"Kow is one to establish 
the consistency of analysis? Certainly not by using a geometrical model; 
this would be a vicious circle. Nor, according to Hilbert and Bernays 1934, 
by appeal to the physical world. For limitations of our measurements in 
the physical world prevent us from saying that a continuum is actually 
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given by experience; rather it is an idea we obtain by extrapolating or 
idealizing what is actually given. 130 

So Hilbert’s proposal to prove classical mathematics as embodied in a 
formal system consistent called for a new method in place of the method 
of giving a model. This method consists in a direct application of the idea 
of consistency, namely, that there be no contradiction or paradox consist- 
ing of two theorems, one of which is the negation of the other. To show 
that this cannot happen, Hilbert proposed to make the proofs in the axio- 
matic theory the object of a mathematical investigation, called metamathe- 
matics or proof theory. Of course, such a demonstration of consistency 
would be relative to the methods used in the metamathematics. Hilbert 
therefore aimed to use in his metamathematics only methods, which we 
call “finitary”, that are intuitively convincing. Specifically, these methods 
should avoid using an “actual” or “completed” infinity. Hilbert’s new 
approach avoids the completed infinite in the statement of the problem 
of proving consistency. For there is only a countable infinity of proofs in 
a given theory, and the consistency proposition only concerns any pair of 
proofs, not the set of all proofs as a completed object. (What the theory 
is supposed to be about may be much less elementary.) So it seemed not 
unreasonable to hope that the problem of consistency, now that it was 
stated in finitary terms, might be solved by finitary methods. 

In §§ 37-39 we shall take a closer look at how parts of mathematics can 
be made into formal axiomatic theories and studied in metamathematics. 

Brouwer was the champion of intuitive thinking in mathematics, as 
Hilbert was of axiomatic thinking. Brouwer’s and Hilbert’s approaches 
may also be called “genetic” (or “constructive”) and “existential”, 
respectively. 

According to Weyl 1946, “Brouwer made it clear, as I think beyond any 
doubt, that there is no evidence supporting the belief in the existential 

character of the totality of all natural numbers The sequence of 

numbers which grows beyond any stage already reached by passing to the 
next number, is a manifold of possibilities open towards infinity; it remains 
forever in the status of creation, but is not a closed realm of things existing 
in themselves.” 

While Hilbert proposed to shore up the structure of classical mathe- 
matics by a consistency proof, Brouwer was ready to abandon those parts 
of mathematics in which mathematicians had been carried away by words 
that had outrun clear meanings. Brouwer proposed instead to develop an 
“intuitionistic” mathematics, which would go only as far as intuition 
would lead it. For Brouwer, the systems of objects for mathematics should 

m Hilbert and Bernays 1934 pp. 15-17; quoted on IM pp. 54-55. 
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be generated by some principles of construction, not brought into existence 
all at once as sets satisfying a list of axioms. 

Since Brouwer took only the “uncompleted” or “potential” infinite as 
intuitive, he declined to accept logical principles which require for their 
justification a conception of infinite sets as completed. So, in a paper 
entitled “The untrustworthiness of the principles of logic” 1908, he chal- 
lenged the assumption that the laws of classical logic have an absolute 
validity, independent of the subject matter to which they are applied. 
Particularly, he criticized the law of the excluded middle, P V ~i P. Consider 
a predicate P(.v) where x ranges over some set D. Applied to 3xP(x) as the 
P, the law says that either there is an x in D such' that P{x) or there is no 
x in D such that P(x) ; in symbols: 3 xP(x) V “ i3xP(x). In case D is a finite 
set (and P(x) is a predicate such that, for each value x in D, we can test 
whether P(x) holds or not), Brouwer finds 3 xP(x) V SxP(x) true. For, 
we can find out whether 3 xP(x) or ~i3 xP(x) by testing, for each member 
x of D in turn, whether or not P(x) holds for that x. Since D is finite, this 
process will (in principle) terminate. But if D is an infinite set, say a count- 
able one, such as the set of the natural numbers, the testing process cannot 
humanly be completed. If we are lucky, we may part way through the 
testing find an x such that P(x). But if there is no such x, or if such an x 
comes too late and doomsday comes too soon, we could search till dooms- 
day and still not have an answer to our question. Accordingly Brouwer 
finds no ground for taking 3 xP(x) V ~i3 xP(x) to be always true, when D is 
infinite. Quoting from Weyl 1946, “According to his view and reading of 
history, classical logic was abstracted from the mathematics of finite sets 
and their subsets. . . . Forgetful of this limited origin, one afterwards mis- 
took that logic for something above and prior to all mathematics, and 
finally applied it, without justification, to the mathematics of infinite sets.” 

Brouwer’s path, like Hilbert’s as we shall see, was beset with difficulties. 
An intuitionistic mathematics was developed (beginning in 1918), which 
in part falls short of classical mathematics in the results obtained, in part 
takes a different direction. In parts common to classical and intuitionistic 
mathematics, the intuitionistic (or “constructive”) proofs, though often 
harder, often give more information. To prove an existence statement 
3xA(x), an intuitionist insists that it be shown how to find an x such that 
A(x). An “indirect proof” by showing that the assumption —t3xA(x) leads 
to contradiction is not accepted by him as showing that 3 xA(x); it estab- 
lishes only — l — 13 xA(x). 137 

Now we touch on the controversy between Brouwer and Hilbert. 

137 we g ave a little fuller sketch in IM § 13. Excellent introductions are in Heyting 
1934, 1955 and 1956. Kleene and Vesley 1965 is intended for readers familiar with IM 
Chapters I— XII (or as a minimum IV-VHI) or the equivalent. 
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Brouwer argued that, even if Hilbert should succeed in giving a consistency 
proof for classical mathematics, that would not make classical mathematics 
correct. Thus he wrote in 1923, “An incorrect theory which is not stopped 
by a contradiction is none the less incorrect, just as a criminal policy un- 
checked by a reprimanding court is none the less criminal.” Hilbert re- 
torted in 1928, “To take the law of the excluded middle away from the 
mathematician would be like denying the astronomer the telescope or the 
boxer the use of his fists.” This controversy between the “formalists”, 
represented by Hilbert, and the “intuitionists”, represented by Brouwer, 
led eventually to the acknowledgement by the intuitionists that Hilbert’s 
program would be unobjectionable if and only if the formalists refrain 
from taking a consistency proof as justification for attaching a real meaning 
to those parts of mathematics which the intuitionists reject as having no 
intuitive basis (Brouwer 1928). 

It remains then for the formalist to explain, after having admitted that 
classical mathematics goes beyond intuitive evidence, how nevertheless its 
nonintuitionistic parts can be of value. Addressing himself to this problem, 
Hilbert 1926, 1928 drew a distinction between real statements which have 
an intuitive meaning, and ideal statements (involving the completed infi- 
nite) which do not. It is a common device of modern mathematics to adjoin 
“ideal elements” to a previously constituted system in order to achieve 
theoretical objectives, such as to simplify the theorems, comprehend them 
under a more unified viewpoint, etc. An example occurs in projective geo- 
metry, where a line at infinity is adjoined to the finite part of the plane so 
that any two (distinct) parallel lines intersect in a point of that line. Thereby 
the exception for parallel lines to the “incidence relations” between points 
and lines is removed. Thus, in projective geometry, not only do each two 
distinct points contain a unique line (which passes through both), but dually 
each two distinct lines contain a unique point (in which they intersect). 
Hilbert argued that just this kind of theoretical gain is achieved by adjoining 
the ideal statements to the real statements in classical mathematics; it is 
through this procedure that classical mathematics achieves its power and 
elegance. 

In this way, mathematics becomes a theoretical construction in which, 
Hilbert says, it should not be expected that each separate statement should 
have a real meaning, any more than that each proposition in a system of 
theoretical physics should be capable of immediate experimental verifica- 
tion; in the latter case, it is the theory as a whole that is tested against reality. 

A concrete example of the theoretical gain obtained by going through 
ideal statements in the process of proving real statements is provided by 
analytic number theory, in which theorems about integers are proved via 
the theory of real or complex numbers. Many propositions of elementary 
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number theory have been proved thus, which either we do not know how 
to prove, or can establish only by much more complicated proofs, if only 
nonanalytic methods are used, 

Closely related to this defense of classical mathematics as a simple and 
elegant systematizing scheme is the defense provided by its success in 
applications to the theoretical sciences, especially physics. This led Weyl 
1926 to pronounce Hilbert correct when mathematics is merged with 
physics in the process of theoretical world construction, while he sided 
with Brouwer in restricting himself to intuitive truths when mathematics 
is pursued for itself alone. 138 

§ 37. Formal systems, metamathematics. In § 36, we discussed 
formal axiomatics, stressing that the primitive terms are to be treated as 
meaningless for the purpose of deductions from the axioms by logic; i.e. 
either they have been assigned no meanings, or the meanings they have 
been assigned are to be left out of account. If they had meanings which 
have to be taken into account, this would amounUto saying that the theo- 
rems depend not only on the properties of the primitive terms expressed 
by the axioms, but also on further properties which enter through the 
use of the meanings. But then those further properties should be stated 
as additional axioms. 

Euclid in his “Elements” failed to state in his “axioms and postulates” 
all the properties he used. Figures accompany many of his proofs. It had 
long been taken for granted that the figures are inessential to the proofs, 
serving only to make it easier to discover them or to follow them or to 
remember them. But they do in fact sometimes introduce information that 
is used. 

This has been dramatically illustrated by devising “proofs” of “false” 
theorems, such as that every triangle is isosceles, 139 There is nothing dif- 
ferent in these “proofs” from proofs given in Euclid except the use of 
slightly distorted figures. 

The “hidden” assumptions of Euclid have been brought into the light 
in modern times, and stated as axioms by Pasch 1882, Hilbert 1899 and 
others. Thus Hilbert’s axioms include the following, adapted from Pasch: 
If a line in the plane of a triangle not passing through a vertex cuts a side, 
it cuts one or the other of the remaining two sides. Our figures do come out 
this way; but nothing in Euclid’s text enables us to prove that they must. 
Hilbert’s “Grundlagen der Geometrie” 1899 gives an elegant treatment of 
Euclidean geometry from the standpoint of formal axiomatics, with all 
assumptions explicitly stated. 

198 See Weyl 1949 pp. 50-62. 

ln W. W. Rouse Ball 1892 (pp. 80-81 in the 11th ed. 1939). Reproduced in J. W. 
Young 1911 pp. 143-145. 
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Now in formal axiomatics, while the primitive terms are to be meaning- 
less, in carrying out the deductions by logic the meanings of the ordinary 
words are used. However, we have seen that theories may differ in their 
logic as well as in their mathematical assumptions. So, to make it perfectly 
explicit what the theorems of a theory are to be, we should carry out the 
step for all the words which is carried out in formal axiomatics for the 
primitive terms; i.e. we should divest them of meaning for the purpose of 
deductions, and carry out the deductions entirely by stated rules applying 
only to the form of the sentences. The logic used in the deductions in 
formal axiomatics must be represented in part by these rules, but may in 
part also be provided by logical axioms. 

To carry out such a complete formalization would not be practicable 
if the theory were kept in an ordinary language such as English. For, the 
word languages have irregularities and ambiguities which would greatly 
complicate the task. 

Indeed mathematics has in the entire modern period profited greatly by 
using special symbolisms, though it has customarily left parts of its sen- 
tences, including the parts involved in logical deduction, in ordinary 
language. The symbolic equation not only represents a great economy in 
writing, but presents an opportunity for manipulations (such as trans- 
posing : e.g. x + 5 — 2 gives * = 2 — 5 and x ~ —3), which, though justi- 
fied by the meanings, are in practice usually carried out speedily without 
stopping to think through the justifications. This is indeed a semi-formal 
kind of reasoning, which greatly increases the power of modern mathe- 
matics. 

The complete formalization which we now desire, for Hilbert’s and 
other purposes, is obtained by combining the symbolization prevalent in 
modern mathematics with the symbolic treatment of logic available from 
the work of Boole, Peirce, Frege, Whitehead and Russell, and others. 
Using these two ingredients, we construct a completely symbolic language 
for the theory we wish to formalize. We specify both its syntax (by “forma- 
tion rules”) and its logic (by “deductive rules” or “transformation rules”). 28 
The result we call a formal system or formalism or logistic system. This 
method of making a theory explicit is sometimes called the logistic method. 

To discuss a formal system, which includes both defining it (i.e. speci- 
fying its formation and transformation rules) and investigating the result, 
we operate in another theory or language, which we call the metatheory or 
metalanguage or syntax language. In contrast, the formal system is the 
object theory or object language. The study of a formal system, carried out 
in the metalanguage as part of informal mathematics, we call metamathe- 
matics or proof theory. ’ 

For the metalanguage we use ordinary English and operate informally, 
i.e. on the bas is of meanings rather than byjqrmal rules (which would 



200 


THE FOUNDATIONS OF MATHEMATICS 


CH. IV 


require a inelametalanguagejb r their statement and us e). Since in the 
metamathematics English is being applied to the discussion only of the 
symbols, sequences of symbols, etc. of the object language, which consti- 
tute a relatively tangible subject matter, it should be free in this context 
from the kind of lack of clarity that was one of the reasons for formalizing. 

Since a formal system (usually) results by formalizing portions of existing 
informal or semiformal mathematics, its symbols, formulas etc, will have 
meanings or interpretations in terms of that informal or semiformal mathe- 
matics. These meanings together we call the {intended or usual or standard) 
interpretation or interpretations of the formal system. 140 If we were not 
aware of this interpretation, the formal system would be devoid of interest 
for us. But the metamathematics, to accomplish its purpose, must study 
the formal system as just itself, i.e. as simply a system of meaningless 
symbols, and may not take into account its interpretation. When we speak 
about the interpretation, we are not doing metamathematics. 

Furthermore, as we saw in § 36, for Hilbert’s program the methods used 
in metamathematics must be ones, called “Unitary”, which are intuitively 
convincing. 

Now we relate the present terminology to that used in Part I. We do 
not consider a language, taken as an object of study, to be a “formal 
system” unless it is a symbolic language (not a part of a word language, 
like English) and unless axioms and rules of inference are specified for it 
(not simply model-theoretic concepts, like “valid” and “valid conse- 
quence”). We do not refer to a language used in studying an object lan- 
guage as the “metalanguage” or “syntax language” unless in it only 
finitary methods are used (though some authors do). Thus “object lan- 
guage” and “observer’s language” as used in Part I are broader terms than 
“formal system” and “metalanguage”. 

In Part 1, we used “proof theory” a little loosely compared to Hilbert’s 
intended sense, to which we shall now adhere. For, we had not actually 
specified a symbolic language there. 141 

140 If that informal or semiformal mathematics is ambiguous (like abstract group 
theory), admitting different interpretations, then we can use “interpretation” here in 
either the singular or the plural, depending on where the focus of our attention is, on the 
semiformal mathematics as the mathematician works with it while developing it, or on 
its interpretations in turn. 

141 In our "proof theory” in Part I we did use only finitary methods, though we didn’t 
say so. In “modej_theorylU-lhe metho ds are no t restricted to be finitar y.. In modern 
“model theory”, we would likewise work with a completely symbolic language. Just as 
Hilbert is responsible for “proof theory” in its present refinement, Tarski 1933, 1935 
etc. is the originator of much of modern model theory. Carnap 1935 did some work 
similar to Tarski 1933. Sometimes proof theory is called “syntax” and model theory 
“semantics”. For a full bibliography of model theory, see Addison, Henkin and Tarski 
1965. 
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If we supply suitable definitions (referring to a symbolic language) of 
“prime formula” or “atom” for the propositional calculus, and of “prime 
predicate expression” or “ion” for the predicate calculus (or with functions 
§ 28, also of “prime function expression” or “meson”), then using them 
as the basis for the definition of “formula” in § 1 or § 16 or § 29 (or of 
“term” and “formula” in § 28 or § 29), we get formal systems of proposi- 
tional calculus and of predicate calculus without or with equality. 142 (In 
§§ 38, 39 we shall actually do this in a variety of ways.) After this is done, 
the theorems in Part I constitute metamathematical theorems, except those 
involving the validity or “valid consequence” notion for the predicate 
calculus (without or with equality), whose definitions are not finitary. 
(However, Corollary Theorem 12 as extended to the predicate calculus in • 
§§ 23, 28, 29 is metamathematical when proved using 1-N, which is finitary, 
although the extension of Theorem 12 itself is not.) 
i Hilbert’s best-publicized objective, to save classical mathematics short 
of the paradoxes by a consistency proof (§ 36), calls for a formal system 
embracing (elementary) number theory (i.e. the theory of natural numbers, 
or also of similar “systems” of X 0 objects), analysis (i.e. the theory of real 
numbers, etc.), and presumably quite a bit more. However the problems 
encountered by metamathematics in treating number theory proved so 
challenging that Hilbert and his coworkers in the two decades 1920-1940 
gave most of their attention to the metamathematics of a system (or 
several systems) of number theory. We shall describe such a formal system 
N in the next section, and it will serve as an example for much of the work 
in Chapter V. Some other formal systems will be described in § 39. 

Of course, other problems than the consistency problem are of interest 
to metamathematicians; and metamathematics has proved fruitful in a 
variety of ways, not all of them foreseen. We shall see some of its other 
t applications below. 

i Another application is to the description and investigation of “machine 
languages” or “computer languages” for use with modern high-speed 
computers. Information needs to be given to computers in exact form, by 
sequences of symbols on tapes, cards, or otherwise, leaving nothing to the 
imagination. The formation of the sequences of symbols used for this 
purpose must be subject to exact syntactical rules, of the sort which first 
became an object of mathematical study in Hilbert’s metamathematics. 143 

§ 38. Formal number theory. We now describe a specific formal 
system N, which is designed to formalize elementary number theory. We 

114 At the same time, as will appear in § 38, we need to be a little more explicit about 
how parentheses are used. 

1,4 Inversely, computers may be applied in metamathematics to seeking proofs of 
theorems, or to checking proposed proofs, etc. 154 
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first introduce the formal symbols, which structurally play the part of 
letters of the alphabet in our formal language (although for the interpreta- 
tion most of them correspond to words in English). These symbols are: 

=>, V, -i, V, 3, = , + , •, 0, a, 6, c,\ . . , 


The commas, and the three dots near the end, are not formal symbols but 
punctuation marks used in displaying the formal symbols on the page. 

The symbols are “variables”; we need to have a countable 

infinity of them available potentially. Since the Latin alphabet has only 
26 letters, for definiteness we shall assume that the variables consist of the 
26 Latin script small letters, and also any of these followed by one or more 
occurrences of the symbol |, so that rv, ^ ^ \ ] * ^III* etc. 

are variables. 144 The variables other than the 26 script letters are then not 
single formal symbols, but finite sequences of formal symbols. The system 
N then has an alphabet of exactly 41 formal symbols. 

Notice that here a^, a,^, . . . with the letters in script 

are the variables in the object language themselves, not names in the meta- 
language for variables in the object language, as are “a”, “b”, “c”, . . . , 
“x”, “y”, “z”, “Xx”, ‘V, “x 3 ”, . . . with the letters in Roman type (fol- 
lowing usage begun in § 16). That is, here a is just a, whereas x can be 
a or rf or <• or <»,, etc., in different metamathematical statements about a 
variable x. 145 

We call a finite sequence of (occurrences of) formal symbols a formal 
expression. Just as formal symbols correspond structurally to letters in 
ordinary languages, formal expressions correspond structurally to words, 
though for the interpretation some of them will represent entire sentences. 
Most formal expressions, like ))a0= and aaa will not interest us. But we 
shall now define two particular classes of significant formal expressions: 
“terms”, which for the interpretation correspond to nouns, and “for- 
mulas”, which for the interpretation correspond to declarative sentences. 
Each definition consists of several clauses. 

Definition of “term”. 1. 0 is a term. 2. The variables a, 6, c, . . . are 
terms. 3-5. ifTaTKLsTire terms, so are (r)', (r)-f(s), and (r)-(s). 6. The 
only terms are those given by 1-5. 

Here “r” and “s” are not formal symbols, but metamathematical vari- 
ables used in the syntax language to represent formal expressions, in this 

114 If it is more convenient to use "a", • • • than "a”, ''■a.f', • • • . 

we can do so in practice by regarding the former as metamathematical abbreviations for 
the latter. 

114 A point arises here with respect to all the formal symbols including the variables, 
which was discussed in § 1 Footnote 6 for 13, a, V, — i when they are symbols of the 
object language (as here). To get names of the formal symbols in the syntax language, 
we simply use specimens of those symbols as names for themselves (“autonymously”). 
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case any terms already constructed. Thus “(r)+(s)” is not a formal 
expression, but a metamathematical expression which becomes a formal 
expression when “r” and “s” are replaced by terms. 

Examples of terms: 0, a, d, e, a^, (0)', ((0)')+(«), (((0)')+("))00- 

Definition of “formula”. 1. If r and s are terms, then (r)=(s) is a 
formula.^I-6. lf~A "and are formulas , so are (A) ~ (B), (A) 3 (B), 

(A) & (B), (A) V (B) and -i(A). 7-8. If A is a formula and x is a variable, 
then Vx(A) and 3x(A) are formulas. 9. The only formulas are those given 
by 1-8. 148 

As was the case with “r” and “s” in the definition of term, “A” and “B” 
are metamathematical variables representing any formulas, and “x” is a 
metamathematical variable representing any formal variable. Thus 
“Vx(A)” becomes a formula when “x” is replaced by any variable, e.g. a, 
and “A” is replaced by any formula, e.g. («)=(/?), giving Vzr((*)=(^)). 
Using e.g. fi instead, we get a different formula V/f((«) =(/)). This is why 
the metamathematical variable “x” is necessary in Clause 7; had we 
written a instead, we would be allowing only V«((//)=(/f)) (but not 
— 00), etc.) as a formula. 

The definition of “term” here agrees basically with that in § 28 for the 
case of four mesons 0, (— )', (—•)+(—), (—)•(—) which we form using 
the individual symbol (or 0-place function symbol) 0, the I -place function 
symbol ', and the two 2-place function symbols + and •. The definition of 
“formula” then agrees with that in §28 (extending § 16) for the case of 
one ion — = — formed using the 2 -place predicate symbol =. 142 

If a formal expression is given, how can we determine whether or not 
it is a term, or is a formula? Consider the following example. 

(1) (3c((((*)') + (o)) = (^)» => (l((«)=0f))). 

We first observe that each of c, a, 6 are terms. We then proceed outwards, 
using the parentheses as a guide, verifying successively that («)', ((«)')+(") 
are terms, and then that (((«)')+(«))=(^), 3c(((( c ) )+("))=(^))> ( a )—(0> 

lu It is important to remember that the formulas of a formal system are not just any 
formulas of informal mathematics, or any finite sequences of (occurrences of) the formal 
symbols of the system; but they are just those finite sequences of the formal symbols 
which are formed in accordance with the rules defining “formula” (here the nine rules 
just listed). 

To emphasize this, many authors call them “well-formed formulas” or “wffs”. (To 
be consistent, if we did so here, we ought to say also “well-formed terms” or “wfts”, 
and “well-formed proofs” or “wfps”) We find “well-formed formulas” and “wffs” a 
little aneuphonious (unwell-sounding). So we prefer, after the well-taken point has been 
well-emphasized, to say simply “formulas” (following Hilbert and Bernays 1934, 1939). 
For the infrequently needed arbitrary finite sequences of formal symbols, we have the 
longer name “formal expressions”. 
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— 1 (( „)=(/£)), and finally (1), are formulas. In practice in the case of quite 
long formal expressions, prior to thus testing to see whether or not they 
are terms or formulas, we may pair parentheses in the following way. 
First, pair a left parenthesis “(” and right parenthesis “)” which occur, 
the left parenthesis to the left of the right one, with no other parenthesis 
between; this pairing may be indicated by attaching subscripts ,. Then 
repeat the process using subscripts 2 , then 3 , etc., each time taking into 
account only parentheses not already subscripted. The result of carrying 
out this process on (1) is as follows: 

— 1 (n — , (io(8 < *)8 = (#' / ‘)»)io)xi- 

Now by following the order of the subscript pairs, we can carry out the 
verification stage by stage that (1) is a formula. By a proper pairing of In 
parentheses, n of them left and n of them right, we mean a pairing such 
that a left parenthesis is always paired with a right parenthesis to the right 
of it and no two pairs separate each other thus: ( 4 (, ), ),. It can be proved 
that 2n such parentheses admit at most one proper pairing. Also it can 
be proved that in a term or formula there always is a proper pairing of 
the parentheses, by which indeed we always can find out in what order the 
parts were put together under the clauses of the definitions of terms and 
formula. 147 

The reader will have observed that here we call for parentheses to be 
introduced with each of our operators for building terms or formulas 
already constructed into larger terms or formulas. This contrasts with 
§§ I, 16, where we only asked for parentheses to be supplied as required 
to avoid ambiguity about the scopes (§ 1 Example 1). But we shall arrange 
that our practice here will in effect be the same as before. To do so, we 
provide that, as a kind of abbreviation in our metamathematical writing, 
we may omit parentheses to any extent such that we can see how to restore 
them (as in §§ I, 16 we supplied them), using the following ranking of the 
present operators: 

3, &, V, -i, Vx, 3x, =, +, •, '. 

Now (1) can be abbreviated to 3 c(c = —!</,,= /' or even to 

3 f /+«=/ z> — \< r—/f. We shall regard these omissions of parentheses as 
being only in the exposition of the metamathematics, and not as altering 
what a term and a formula strictly are. This keeps the fundamental meta- 
mathematical definitions of “term” and “formula” simpler than if we 
incorporated into them explicit rules for supplying parentheses selectively. 
Of course, here the logistic method requires complete explicitness in these 


117 See IM pp. 23-24, 73-74. 
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fundamental definitions. Similarly, in our exposition we may change some 
parentheses to square or curly brackets to assist the eye in pairing them. 

We also introduce new metamathematical symbols to permit abbrevia- 
tions of terms and formulas. For instance, is an abbreviation for 

— i and for 3 c(c +a=6) or 3 d{d' -\-a=S), etc. Here 

and “<” are symbols used for abbreviation, not formal symbols. In the 
case of the abbreviation there is ambiguity what variable to use 

in the quantifier which we supply in unabbreviating. The general rule shall 
be that “r<s” abbreviates 3x(x'4-r=s), where x can be any variable not 
occurring in r or in s. Under this rule, two legitimate unabbreviations of 
“r<s” will be congruent (§ 16), so it will be immaterial for our usual 
purposes which legitimate unabbreviation we choose. (For, in the first 
place, two congruent formulas have the same meaning under the inter- 
pretation of the symbolism. And in the metamathematics, either will be 
provable if the other is, by Theorem 25 with Corollary 2 Theorem 23 § 24, 
both of which will be available here ; and likewise either can replace the 
other in a deducibility relationship.) Now (1) can be written «</ D a ^ 6. 
Further useful abbreviations are l V>^” for «?<«, “«<<?<>” for 
a <S&/!<c, etc.; and “1” for O' (i.e. for (0)'), “2” for 1' (i.e. for ((0)')'), 
“3” for 2', etc. 

The list of formal symbols and the definitions of “term” and “formula” 
constitute the formation rules of our formal system (analogous to the rules 
of syntax in ordinary grammar). We shall now give the definitions that 
establish the deductive structure of the system (the transformation rules or 
deductive rules). These begin with a list of axiom schemata, particular 
axioms and rules of inference (which we may call collectively the “postu- 
lates”). When we have given this list, then “(formal) proof (of Bj)”, “B is 

provable” or in symbols “ b B”, “(formal) deduction (of B,) from A : 

A m (holding all variables constant)”, “B is (formally) deducible from 
A 1( . . . , A,„ (holding all variables constant)” or in symbols “Aj, . . . , 
A m h B”, etc. are to be defined for N from its postulate list as before for 
the propositional and predicate calculi from their smaller postulate lists 
(§§9,21). 

Here we reemphasize from § 9 that a (formal) proof of B is an object of 
the object language (§§ 1, 37); specifically, it is a suitable finite sequence 
of formulas, which in turn are suitable finite sequences of formal symbols, 
where “suitable” means of the respective kinds defined above. We may 
talk about such proofs, construct them, or “prove” that they exist. Here 
the “prove” in quotes is the ordinary word in English, and means that 
there is a proof in the intuitive sense (an informal proof) in the observer’s 
language, now called the metalanguage. A formal proof is a proof of a 
formula, which (for the metamathematics) is a meaningless sequence of 
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symbols. An informal proof (in metamathematics) is a proof of a meaning- 
ful statement about the meaningless formal objects; and this informal 
proof is an intuitive demonstration of the truth of that statement. Thus a 
“proof that F B” or “a proof of ' F B’ ” is an informal proof of the fact 
that a formal proof of B exists. We might try to use a different word than 
“proof” for informal proofs, but it is not convenient to do so; so we rely 
on the context to make it clear when we mean a formal proof (in the object 
language) and when an informal proof (in the metalanguage). 

To begin with, N shall have all the postulates of the predicate calculus. 
That is, it shall have the three rules of inference, the 3-rule or modus 
ponens (§ 9 and Theorem 3), the V-rule and the 3-rule (§ 21 and Theorem 
16); and it shall have Axiom Schemata la-lOb (§9 and Theorem 2), and 
the V-schema and the 3-schema (§21 and Theorem 15), i.e. all formulas 
of these forms shall be axioms. Moreover, we now permit as the r for the 
V-schema and the 3-schema not merely a variable as in §21 but more 
generally as in § 28 any term r such that, when r is substituted for the free 
occurrences of x in A(x) with result A(r), no occurrence of a variable in 
any of the resulting occurrences of r will be bound. We say such a term r 
is free for x in A(x) (generalizing § 18 from, variables to terms, as in § 28). 
For example, taking x to be a, r to be <ft+e, and A(x) to be 
3c(c' +«—/•)&. -i«=0, the condition is satisfied; but not for the same x 
and r when A(x) is & **l<z=0. 

Therefore, statements of the form “ F B” or “Aj, . . . , A m F B” or 
“Aj, .... A,,, F x *— x « B” (direct rules) which hold for the predicate cal- 
culus now hold for N (cf. § 21). Moreover (as in § 28), in the direct rules of 
Theorem 21 r can be any term in the present sense, and in its Corollaries 
r t , . . . , r„ can be any list of terms not necessarily distinct, in each case 
subject to the freedom condition stated there. 

In addition to the postulates of the predicate cMculus (with the more 
general r for the V-schema and the 3-schema), there shall be one axiom 
schema (13 below) and eight particular axioms (14-21 below). For Axiom 
Schema 13, x is any variable, A(x) is any formula, and A(0), A(x') are the 
results of substituting 0, x' respectively for the free occurrences of x in 
A(x). (These substitutions are automatically free.) 


13. 

A(0) & Vx(A(x) 3 A(x')) ^ A(x). 



14. 

a —ft 3 a = 

15. 

— i«'=0. 

16. 

u—(> 3 (a = c 3 /=c). 

17. 

rr—6 3 a' = ft . 

18. 

a +0=^. 

19. 

a-\- ft = {a-\- f)' 

20. 

^•0=0. 

21. 

4 

+ 

II 

e 


Thus N consists of the predicate calculus plus some "nonlogical axioms” 
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(or “mathematical axioms”), namely eight particular axioms 14-21 and 
N 0 axioms by Axiom Schema 13. 148 

The deduction theorem (Theorem 11, §§ 10, 22) extends to N. For (as 
in § 29), we can handle the new axioms under the old Case 3. Hence in N 
we have the introduction and elimination rules of Theorems 13 and 21 
(5 subsidiary deduction rules based on the deduction theorem, and 13 
direct rules). 

As we remarked in § 37, a formal system formalizing a portion of 
informal mathematics has an “intended” (or “usual” or “standard”) inter- 
pretation. (When we discuss this, or model theory more generally, we are 
going outside of metamathematics.) The informal mathematics that we aim 
to formalize in N is elementary number theory. So for the intended interpre- 
tation, the variables range over the natural numbers (0, 1, 2, . . .}, i.e. this 
set is the domain. The logical symbols 3, &, V, -i, V, 3 are interpreted 
as in Chapters I and II (in classical logic). The function symbol ' is inter- 
preted as expressing the successor function + 1 , and 0 (“zero”), + (“plus”), 

• (“times”) and = (“equals”) have the same meanings as those symbols 
convey in informal mathematics. 149 All the terms in N are names for 
natural numbers, specified or unspecified, just as the formulas express 
propositions. 53 - 105 

The nonlogical axioms play a role under the interpretation like assump- 
tion formulas for the valid consequence relationship (II) of § 20 with all 
their free variables having the generality interpretation. Thus in taking 
Axiom 14 as a permanent assumption, or axiom, for elementary number 
theory, we intend to assume that, for every pair of natural numbers a and 
b, if a+1 = 6+1, then a — b. So Axiom 14 is to be regarded as synony- 
mous with its closure ai S{a =6' ZD a—(). 

The deductive rules of N are in keeping with this interpretation. For, 
by uses of V-introd. in Theorem 21 (with the T empty, so Proviso (B) does 
not inhibit us), the closure of each axiom is provable using the axiom. 
Inversely, if we had taken the closures of the present nonlogical axioms 
as the nonlogical axioms, then the present ones would be provable by 
applications of V-elim. Thus in setting up N, it made no essential differ- 
ence whether the nonlogical axioms were written down open or closed. 

149 Most of the next remarks (down through (A) and (B)> apply similarly to any formal 
system consisting of the predicate calculus, or the predicate calculus with equality, 
plus nonlogical axioms. This includes the predicate calculus with equality itself as 
based on the predicate calculus without equality (§ 29). For systems based on the 
predicate calculus with equality, the “nonlogical axioms” are those added to the 
axioms of the predicate calculus with equality. 

149 We rely on the context to make it clear when 0, +, -, = are being used as formal 
symbols and 5+ <, >, 1 , 2, 3, . . . in abbreviating formal expressions, and when they 
are being used informally. 
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The open axioms (and provable formulas) are simpler to write, and their 
use fits with common mathematical practice. 150 We now naturally ex- 
tend the terminology used with (temporary) assumptions in § 20 to say 
the free variables in the axioms of N, and also in any provable formulas 
(cf. Exercise 23.4), have the “generality interpretation”. (In a deduci- 
bility relationship in N, the free variables of the assumption formulas which 
have conditional interpretation stand for the same natural numbers in the 
conclusion B as in the assumption formulas Aj, .... A„„ i.e. for ones 
satisfying the conditions expressed by A„ . . . , A„„ while the other free 
variables of B have the generality interpretation; cf. Exercise 23.5, where 
Xj, . . . , x a may be taken to include all the free variables of B not free in 
Ai, . . ■ i A m .) 

Using V-introds. and V-elims. as just shown: (A) 1- B/«N, if and only if, 

for some list A! A m of nonlogical axioms of N, VA,, . . . , VA m h B 

in the predicate calculus. For the “only if” part, A lf . . . , A,„ can be the 
(finitely many) nonlogical axioms actually used in some particular proof 
of B. 151 

In logic (Chapters II, III), we did not have in mind a particular domain 
and a particular interpretation of the ions and mesons (except of — = — 
in § 29). Let us see how th e model-theoretic concepts of validity and “valid 
c onsequence” (symbolized by “k ”) which we used there apply now. We 
need the versions at least with functions (§ 28), since in N we have the 
function symbols 0 as mesons. We can use those of § 29 with both 

functions and equality, where the predicate symbol = is given the meaning 
of equality or identity (which it has in the intended interpretation of N). 
For, although we based N proof-theoretically on the predicate calculus 
without equality, the postulates of the latter are all good for the predicate 
calculus with equality. As the name “nonlogical axioms” suggests, those 
axioms of N are not valid (or with the validity notion of § 29, all but 
Axioms 16 and 17 are invalid). So Theorem 12 does not simply extend to 
N to say that “If F E, then 1= E”. Instead, using (A) with Corollary Theorem 
11 (as extended to N), and Theorem 12 and Corollary Theorem 8 (in the 
versions of § 28 , or of § 29): (B) If b B in N, then, for some list A,„ 

150 A closed formula is called a sentence by Tarski (in the Eng. tr. of 1933, in 
Tarski, Mostowski and Robinson 1953, etc.). We are using “sentence” simply for 
the linguistic objects (declarative sentences) in the informal language which are formal- 
ized by formulas, open or closed. 

A proof of B in N using as nonlogical axioms only A,, . . . , A„, is not in general a 
deduction of B from A, A,„ in the predicate calculus with all variables held con- 

stant. For in the latter (§ 21), we are restricted in using the V-rulc and the 3-rule below 
the first use of A,. .... A,„ to variables not occurring free in Ai, .... A,„. In a proof in 
N, where A,, . . . , A„, count as axioms rather than as assumption formulas with their 
variables held constant, this restriction is lifted. Hence the V’s arc necessary in (A). 
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of nonlogical axioms of N, VA l5 . . . , VA m N B. Thus, if V B in N, then 
B is t for every nonempty domain D and every assignment in D which 
gives the value t to the closures of all the nonlogical axioms of N (and 
hence to any finite list VA l5 , . . , VA m of those closures). 

Now what are these domains D and assignments in D? In less technical 
language, what interpretations make all the nonlogical axioms of N true 
under the generality interpretation of their free variables? One of them is 
the intended interpretation of N already described. Whether the nonlogical 
axioms arc also true under some other interpretation than this is a question 
for investigation, the result of which we shall report in Chapter VI (§ 53). 

Now let us consider the significance of each of the nonlogical axioms 
of N under the intended interpretation. 

Axioms 14 and 15 and Axiom Schema 13 formalize the third, fourth 
and fifth of Peano’s list of five axioms for the natural numbers, 1889. 152 
Peano’s first axiom, that 0 is a natural number, and his second axiom, that if 
n is a natural number so is «+l, are taken care of instead by the formation 
rules which make 0 a term and whenever r is a term make (r)' a term, since 
ail the terms in the system are interpreted as expressing natural numbers. 

Axioms 16 and 17 are axioms for equality. We do not postulate the 
reflexive law of equality, because it is deducible from Axioms 16 

and 18 with /, « general by the predicate calculus (as we shall show 
below), and so is provable in N. From a=a with Axiom 16 the symmetric 
and transitive laws can be proved (below). Axiom 17 asserts that the value 
of the successor function ' is determined by the value of its variable. In 
§ 29 we called this the "(open) equality axiom for The two equality 
axioms for + and the two for • are provable, as will be indicated below. 

Axioms 18-19 and 20-21 provide " recursive definitio ns” or “definitions 
by induction” of the functions + (plus or addition) and ■ (times or multi- 
plication). Let us see how they “define” these functions. They are not 
definitions which simply introduce abbreviations for combinations of sym- 
bols already available. Instead, the two equations 18 and 19 enable us, for 
any fixed value of a (e.g. 3), to determine the value of w+/ successively 
for 0, 1,2,... as value of /, thus (using the symbolism informally for the 
moment): 

(A*) 3+0 = 3 [by Axiom 18], 

(A?) 3 + 1 = 3+0' = (3+0)' [by Axiom 19] = 3' [by (A*)] = 4, 

(AD 3+2 = 3+1' = (3+ 1)' [by Axiom 19] = 4' [by (A?)] = 5, 


151 Peano dealt instead with the positive integers. A quite full explanation of these 
axioms is in IM §§ 6, 7 (and in § 8, which overlaps the present § 36). That explanation is 
not essential to the present survey. The role of the fifth Peano axiom (Axiom Schema 13) 
is illustrated in Example 2 below; of the third and fourth, in Exercise 38.5. 
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With equations (AJ) giving the values of for any values a and b of 
a and available now, we can proceed similarly to determine the values 
of <r/> (e.g. with 3 as value of a): 

(Mg) 3 0 = 0 [Ax. 20], 

(MJ) 3-1 = 3-0' = 3-0+3 [Ax. 21] = 0+3 [(Mg)] = 3 [(AJ)], 

(Mi!) 3-2 = 3-1' = 3-1 + 3 [Ax. 21] = 3+3 [(MJ)] = 6 [(A*)], 


Example 1. The following is a (formal) proof in N. Strictly speaking, 
the proof is the sequence of 17 formulas. In addition, numbers have been 
supplied at the left, and explanations at the right. 

1. n—f 3 {a—c 3 fi—c) — Axiom 16. 

2. 0=0 3 (0=0 3 0=0) — Axiom Schema la. 

3. {« = /> 3 (a = c 3 <?=*)} 3 {[0=0 3 (0=0 3 0=0)] 3 
[a,=6 3 (a—c 3 6—c)]} — Axiom Schema la. 

4. [0=0 3 (0=0 3 0=0)] 3 [«=/ 3 (a=c 3 6=c)] - 3-rule, 1, 3. 

5. [0=0 3 (0=0 3 0=0)] 3 Vc[a=tf 3 (a=c 3 t=c)] - V-rule, 4. 

6. [0 = 0 3 (0 = 0 3 0=0)] 3 V^Vc[«=/ 3(«=0 <?=c)] - V-rule, 5. 

7. [0=0 3 (0=0 3 0=0)] 3 V^V4«=/3(«=o 3 t=c)] 

— V-rule, 6. 

8. V«VA^[«=/f 3 (a=c 3 6=c)) - 3-rule, 2, 7. 

9. V*VAfr[*W 3 («=c 3 /=*)] 3 

VA/c[« +0=t? 3 (<r+0=<- 3 6—c)\ — V-schema. 

10. VAM"+0 = ^3 («+0=r 3 <?=«)] - 3-rule, 8, 9. 

11. V/Vf[«+0=/3(®+0=«3/=4]3 

V< [«+0=« 3 (<j+0=£ 3 a=c)] — V-schema. 

12. V4«+0=« 3 (^+0=c 3 a=c)\ — 3-rule, 10, 11. 

13. Vr[// +0 = « 3(«+0=£3 a—c) ] 3 [<i+0= a 3 («+0= <-t 3 <-< = «)] 
— V-schema. 

14. *7+0=//. 3 (a-\-0= a 3 a = a) — 3-rule, 12, 13. 

15. (f + 0—a — Axiom 18. 

16. /7+0=<7 3 a — <t — 3-rule, 15, 14. 

17. a — a — 3-rule, 15, 16. 

Thus a — a (the reflexive law of equality) is a provable formula; or in 
symbols, I- a=a. (Here “F a=a” is not a formula, but a statement in 
our mctamathematical shorthand that the formula u=a is formally 
provable.) 

Now we give informal proofs in the formats (A) and (B,) (cf. §§ 13, 25) 
of the fact that F «■ = « (i.e. that a formal proof of a = a exists). 
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1 . b 3 («.=« 3 ^=.c) — true because the formula concerned is 
Axiom 16. 

2. b /7+0=« 3 (/z +0=<7. 3 a.=a ) — from 1 by substituting /r+0, 

« for /?, « (Theorem 21 Corollary 2 (d) for Y empty). 

3. btf+0 = /r — true because <7+0 = <v, is Axiom 18. 

4. h a— a — from 3 and 2 by two applications of 3-elim. 

Substituting <2+0, a, a for a, (>, c in Axiom 16, 

(B x ) « + 0=« 3 (<t+0=a 3 a=a). From this and Axiom 18 by 3- 

elims., u — a,. 

Repeating in part some points made in Chapters 1 and II (especially 
§§ 10, 13, 25), forma! proofs even of simple formulas tend to be long, 
as Example 1 illustrates. We are interested in what formulas there are 
formal proofs of, i.e. in what formulas are formally provable. We shall 
be satisfied to know whether formal proofs of these formulas exist, and 
we shall not ordinarily care about actually seeing such formal proofs when 
we have shown that they exist. Therefore we find it satisfactory to use 
informal proofs to show that the formal proofs exist, when it is easier to 
do so and when the informal proofs are by “finitary” methods (§§ 36, 37). 1M 
If challenged to find the formal proofs we have demonstrated to exist, we 
are then able to supply them. Continuing in the manner illustrated, our 
methods for informally proving (in metamathematics) the existence of 
formal proofs (now in the system N) can be brought quite close to the 
methods of the ordinary mathematician in developing the elementary 
theory of the natural numbers. However, we must not forget that we are 
doing something somewhat different than he; i.e. we must not lose sight 
of the path by which our informal proofs can be converted into formal 
proofs in N. 

We shall go a bit further in illustrating the beginnings of the develop- 
ment of number theory in N, by giving a few more metamathematical or 
informal proofs that certain formulas expressing theorems of elementary 
number theory have formal proofs in N. 

From the reflexive property of equality b a — a (*100) and Axiom 16, 
we readily obtain the symmetric and transitive properties b n=f- 3 

153 If computing machines are to be applied to seek or check proofs, a further step is 
appropriate. Some selection of the quicker and easier methods to which our meta- 
mathematical investigations have led us, or (for seeking proofs) possibly of other methods 
tailored to the machines strengths and weaknesses, should be stereotyped as a new 
formal system in which the computer is to make its searches or perform its checks. Here 
we arc not referring just to formal number theory, but also to the predicate calculus and 
other systems obtained by adding mathematical axioms to it. Cf. Wang 1960, Davis and 
Putnam 1960, J. A. Robinson 1963, 1965. 
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(*10l) and V «=/■ & fi—c D //=<: (*102). irj4 (If it isn’t obvious, refer to 
Theorem 29 in § 29.) 

Having the reflexive, symmetric and transitive properties of equality, 
we can use chains of equalities analogously to chains of equivalences in 
§ 5, except that we do not yet have the general replacement theorem 
(Theorem 30, analogous to Theorem 5) to use in justifying links of the 
chains. We use chains to simplify the presentation in the next example. 

Exempli: 2. We show that F a—/'- D ft+c~/>+f (*104). (In the termi- 
nology of § 29, this is one of the two open equality axioms for +.) 
Preparatory to using D-introd., assume (a) Preparatory to applying 
Axiom Schema 13 (“mathematical induction”) with /■, et+c—fi+c as the x, 
A(x), we need to deduce (from <?.=/) the two formulas «-f0=/+0 and 
Ve(r/ -f-r — f~\~c D a-\-c —SArc"), I. (BASIS.) ^-p0 “ ^ [Ax. 18] — f' 
[(a)] = /+0 [Ax. 18 with substitution; this gives (cf. Example 

1 (A) Step 2), whence by the symmetric property of equality /f=/?+0]. 
So (using the transitive property of equality, implicit in the chain method) 
//+0=/-P0. II. (Induction step.) Preparatory to D-introd., assume 
(b) « + /—/+£. Then /i+c'=(a+c)' [Ax. 19 with subst.] = (/+c)' [using 
(b) and Ax. 17, with subst. and D-elim.] = 6-\-c [Ax. 19 with subst.]. 
By the planned D-introd. (discharging the assumption (b)), 

(i+r=/ r +c Z3 a+c'—f+c. Thence by V-introd. (since our remaining 
assumption (a) does not contain c free), 'ic(a-\-c—t+c D a+c' =f+c). 
III. By Ax. Sch. 13 with the results of I and II and &-introd. and D-elim., 
^+c=/+r. By D-introd. (discharging (a)), D a+c—fi+c. 

Next we can establish (the provability of) the other equality axiom 
for +, namely *i—6 D c-\-a—c+(' (*105), though this isn’t quite so easy 
(Exercise 38.2). The two for • can be shown to be provable similarly. Then 
we will have all the axioms of the predicate calculus with equality (§ 29) 
which go with the symbolism of N. So by Theorem 30 in § 29, we will have 
the replacement property of equality in general; so chains of equalities 
can be constructed using any replacements based on available equalities. 
(Before we had to be careful to use replacement steps only when we had 
special justification, as by Axiom 17 in II of Example 2.) Now (or earlier, 
after establishing just the equality axioms for +), we can prove the associa- 
tive law for addition («'+/)+c=rt , +(/+r) ( * 1 17; Exercise 38.3), etc. 

The foregoing gives only enough of a beginning to convey an impression 
of how the development of number theory in N would go. A week or two 
of study would bring us quite far in this development. 

By carrying such a development quite far, partly in a straightforward 
way as illustrated, and partly through investigations of a general nature, 

151 We give parenthetically the numbers with which these results appear in IM 
Chapter VIII. 
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it can be said to be established that the system N is adequate for the usual 
elementary number theory such as occurs in standard texts (but not for 
analytic number theory, end § 36). By this we mean that, first, the predicates 
and propositions commonly used in informal elementary number theory 
can be expressed by formulas in N, and, second, for those propositions 
which are commonly proved as theorems in informal number theory, the 
formulas expressing them are formally provable in N (as illustrated by a 
few simple examples just now). 

The first part calls for a little explanation. We have already seen that, 
although <t </ is not provided directly in the formal sy mbolism, a formula 
3 /■(/+«=/) is so provided which under the (intended) interpretation 
expresses a<b. By introducing “<” as a symbol of abbreviation, we can 
thus express inequalities. That a divides b can be expressed by 3/ (*•<•=/), 
which we abbreviate ‘V|/’\ That a is a prime number can be expressed by 
1 O & — j3<-( I <r & r<» & which we abbreviate “Pr(«)”. Euclid’s 
theorem that there are infinitely many primes can be expressed by 
3/(Pr(/)& d</) or Vc/3/(Pr(/) & <v </). The provability of this formula 
is established in IM p. 192 *161, where it appears about 60 results later 
than our Exercise 38.3. (By no means are all intervening results used in 
its proof.) 

Since 0, • are the only function symbols in N, besides which vari- 

ables.are available, no function other than a polynomial can be expressed 
by a term of N. This is undoubtedly a limitation in the symbolism of N, 
but it can be circumvented. For what can be expressed informally using 
functions can in fact be paraphrased using predicates instead. Thus, say 
f(x x , . . . , x„) is a number-theoretic function of n variables, and let 
F(x u . . . , x„, y) be the predicate of /;+ 1 variables which is true for exactly 
those /?+ I-tuples (*,, .... x„,y) such that f(x i, . . . , x„) = y. We call 
F(x x , . . . , x tl , y) the representing predicate of f(x t , . . . , x„). What can be 

stated using /(•«■,, .... x„) can be paraphrased using F(.c 1 x tl , y) 

instead. For example, consider the function xl (where 0! = I and 
(,r+ 1 )! = I • 2 • . . . • (x+ I ) with *+ I factors). Let F(x, //) express x\ = y. 
Take the proposition (x+l)! = x'.(x+\), which is an (informal) theorem 
about the factorial function. This can be paraphrased in terms of the 
predicate F(x, //) e.g. thus: 1, u) & F(x, v) & u—vix+ 1)]. 

Now it turns out that, although only polynomials can be expressed in 

N by means of terms, the representing predicates F(x x n , y) of a 

vastly greater class of functions can be expressed in N. 155 

Furthermore, not only can the propositions using such functions be 
expressed indirectly by using their representing predicates, but all the 

This is a consequence of work of Godel 1931 and Kleene 1936 (cf. IM §§48, 49, 
57, or specifically Theorem VII (b) p. 285). 
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reasoning that could be carried out with the functions can be paralleled 

too . 156 

Thus it comes about that N is adequate for the usual elementary number 
theory despite its obvious shortage of function symbols. 

One might ask: Would it not be better to remedy this deficiency by 
constructing another system with more function symbols? One can do 
this, but for foundational questions it is often best to keep the system as 
simple in its structure as possible, In fact, the theory just cited allows us 
to use such systems richer in functions than N and construe all the results 
in terms of N. 

Another question is whether we could still get the same results with 
even fewer function symbols in the symbolism. Specifically, could we omit 
• as a formal symbol, and omit Axioms 20 and 21, and then get the effect 
of • in the same way that in N we get the effect of x\ etc.? The answer is 
“No ”. 157 

So far, in the case of N, we have used Hilbert’s idea of studying the 
formal system from outside by “finitary” methods (in metamathematics) 
mainly in developing short cuts by which we prove metamathematically 
(informally) that various formulas are formally provable. Hilbert intended 
of course that metamathematics should concern itself also with general 
questions about the formal system, such as those as to its consistency and 
completeness. (We did deal with both of these for the propositional cal- 
culus in §§ 1 1, 12, and with consistency for the predicate calculus in § 23.) 

Ackermann in 1924-5 thought he had proved metamathematically the 
consistency of N. But von Neumann in 1927 pointed out that Ackermann’s 
proof is limited to the subsystem of N in which the use of Axiom Schema 13 
(the induction schema) is restricted to the case of an A(x) which does not 
contain any free occurrence of x in the scope of a quantifier, i.e. in the B 
of a part VyB or 3y B ; at the same time von Neumann gave another meta- 
mathcmatical consistency proof for that subsystem. 

The failure of metamathematicians to establish the consistency of N in 
the next few years was illuminated in 1931 by results of Godel, which we 
will present in a general setting in Chapter V (especially §§ 43, 44). These 
results begin with an answer to the completeness question (in which we 
ask whether N suffices for all elementary number-theory, not just for 
what is usually developed). 

130 The basic step is the elimination of “proper definite descriptions” (§ 31), using a 
theorem of Hilbert and Bcrnays 1934 (cf. pp. 422-457 and 460 ft'.). There is a treatment 
in IM pp. 407-419. The theorem gives directly conditions under which the effect of 

fU j „) can be secured in a system in which we have F{x u . , . , x„, y). It is a proof- 

theoretic version of Theorem 32 § 31. 

157 This follows by combining a result of Presburger 1930 and Theorem IV in §43 
below. (Cf. I M pp, 204, 407.) 
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In the following chapters, N can always refer to the particular formal 
system described in this section (which is the simplest way to read those 
chapters), and sometimes (when we so indicate) N can also refer more 
generally to any system with similar properties. 

Exercises. 38.1. Translate Example 2 (as given in the format (BO) 
into statements using the symbol b (in the format (B 3 )), and check it in 
the latter format (cf. §§ 13, 25). 

38.2*. Show that b a=6 3 c+a—c+6. 

38.3. Assuming the result of Exercise 38.2, and using the general 
method of Example 2, show that b (a+6)+c=a+(6+c). 

38.4. Show that b 3+0 = 3, b 3+1=4, b 3+2=5, ...; i.e. establish 

statements of formal provability corresponding to (Aj[), (AJ), (A|), 

38.5. Show that, using besides predicate calculus only Axioms 14 and 
15: bl^O, b2^0, b 2?^ 1, b 3;^0, b 3^ 1, b 3?*2, . . .. 

* § 39. Some other formal systems. In this section, we give 
further examples [2]— 150] of formal systems (where [l] is the system N of 
§38). 

We now describe [2] a system G, which formalizes the elementary 
theory of an unspecified “group” (explanation follows). 

The formal symbols shall be 

V, — I, V, 3, =, •, 1, •<*, e , . . . , |, (, ). 

Variables are to be constructed as in § 38 for N. (The only difference in 
the formation rules between N .and G is that +, •, ', 0 are replaced by 

•,-M.) 

Definition of “term”. 1. 1 is a term. 2. The variables a, 6, c, . . . are 
terms. 3-4. If r and s are terms, so are (r)-(s) and (r) -1 . 5. The only terms 
are those given by 1-4. 

Given this definition of “term”, the definition of “formula” from 
“term” reads as in § 38. 

Parentheses are omitted under the same conventions as there. Further- 
more (as we could also have done in § 38), we shall abbreviate rs (i.e. 
(r)-(s)) as “rs” (omitting the dot). 

As postulates, we take those of the predicate calculus, with the present 
notions of “term” and “formula” (§§21, 28); just as for N, the r for the 
V-schema and the 3-schema can be any term free for x in A(x). In addition, 
there shall be the following six particular axioms: 

El. « = ZD (a = c Z3 $—c). E2. E3. a,— 6 3 ca=c&. 

Gl. (at?)c= a{fc). (Associative law.) 

G2. a\ =a,. (Right identity.) 

G3. ««“ 1 =1. (Right inverse of a.) 
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We do not intend a single interpretation of this formal system G, as 
we did of N. The system G may be interpreted by any “group” G, as we 
shall explain and illustrate in a moment. For any choice of a “group” G, 
the terms are interpreted as naming members (specified or unspecified) 
of G (or more precisely of G 0 , below). 

Axioms E1-E3 give what we need to postulate of the properties of 
equality (identity). 168 

The axioms G1-G3 for groups will be familiar to many students. A 
group G is briefly any “system” of objects which “satisfies” these three 
axioms (with the variables in the generality interpretation §§ 20, 38, and 
with = expressing "identity” or “equality” § 29). 

That is, a group G consists of a nonempty set G 0 , a 2-place function 
crb with arguments and values in G 0 , a 1-place function a -1 with argu- 
ments and values in G 0 , and a member of G„ (or 0-place function) 1, such 
that the closures of G1-G3 are all t in our model theory of the predicate 
calculus with equality §29 when G 0 is the domain and a-b, a -1 , 1 are the 
values of a-f, or 1 , l. 159 

To give only a few particular interpretations, G can be (l) the positive 
rational numbers with •, 1 having their usual meanings. (The student 

should have no trouble verifying that G1-G3 are satisfied by each of our 
interpretations.) Alternatively, G can be (2) the rational numbers except 
0, (3) the positive real numbers, (4) the real numbers except 0, or (5) the 
complex numbers except 0, with •, -1 , 1 in their usual meanings. Still 
again, G can be (6) all the integers, (7) all the rational numbers, (8) all the 
real numbers, or (9) all the complex numbers, with a-f, a~ x , 1 meaning 
a+b, — a, 0, respectively. 

As an interpretation not by a number system, G can be (10) or (li) all 
the possible rotations of a square within its plane, or through three- 
dimensional space, into itself. That is, we are to rotate the square so that 
after the rotation it occupies the same portion of space as before, but the 
original corners may be in new positions; e.g. if the original square is 
A BCD, then after the rotation A may be where D formerly was, B where A 
formerly was, etc. Also, turning the square by —90° or 270° or 630° about 

158 Alternatively, we could have started from the predicate calculus with equality 
(§ 29), thus automatically postulating five open equality axioms (or their equivalent), 1118 
and have added just the three axioms G1-G3. 

158 Textbooks on (informal) group theory are e.g. Marshall Hall 1959 and Rotman 
19 65. 

The symbols “|” are often chosen differently. Particularly, is often 

written as to suggest a variety of interpretations, in some of which 0 will not be 
the usual multiplication, and "1” as “/” (the “identity”). 

“Group theory” includes nonelementary parts (e.g. subgroups, isomorphism, 
representations), not formalized in G. 
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an axis perpendicular to its center is considered the same rotation; so 
“rotation” really means “result of rotation”. Now a-fi is interpreted to 
mean (the result of) the rotation a followed by the rotation b. The further 
details are left to the student (Exercise 39.1). 

Similarly, G can be ( 12 ) or (t3) all the possible rotations of a circle into 
itself, or (14) all the possible rotations in 3-dimensional space of a cube 
into itself, or ( 15 ) of a sphere into itself. 

Now we demonstrate (informally) the (formal) provability of a number 
of formulas (writing “T” for “theorem”). Proofs not given are exercises. 
As was remarked for N in § 38, the direct rules of the predicate calculus, 
and also all of the introduction and elimination rules of Theorems 13 
and 21, hold good for G. 

Tl. 1- a— a. (Reflexive property of equality.) 

Proof (in the format (BO). By substituting al, a, a for ti, c in 
Axiom El, a\ = a D(tol=to 3 a=a). Thence using G2 and 3-elim. twice, 
tt=a. — The next two follow as did *101 and *102 in § 38. 

T2I h a=6 3 f>=a. (Symmetric property of equality.) 

T3. 1- «=/?& fi—c 3 a—c. (Transitive property of equality.) 

Having now the reflexive, symmetric and transitive properties of 
equality, we can use chains of equalities (though without unrestricted 
replacement yet; cf. § 38). 

T4. t- or 1 a— 1. (Left inverse of a, the same as our right one.) 

Proof (by a chain of equalities). a~ l a = (a~ x a) 1 [G2 (and sub- 
stitution)] = (a~ 1 a)(a~ 1 (a~ 1 )~ 1 ) [G3, E3] = « _1 (<7(«~ 1 ("~ 1 ) -1 )) [Gl] = 
«-•((* trr'X#- 1 )- 1 ) [Gl, E3] = to-'Ui*- 1 )- 1 ) [G3, E2, E3] = 

(*-M X*- 1 )" 1 [Gl] = to~ l (a~ 1 )- 1 [G2, E2] = 1 [G3]. 

T5. h \ti = <r. (Left identity, the same as our right one.) 

Proof. 1 « — {aa~ l )a [G3, E2] = a(a~ x a) [Gl] = a l [T4, E3] = 
« [G2]. 

T6. h = a 3 x=l. (Uniqueness of right identity.) 

Proof. Assume for 3-introd., ax — a,' Now x = \x [T5] = 
{a- x a)x [T4, E2] = a~\ax) [Gl] = a~ l a [by the assumption ux = a 
with E3] = 1 [T4J. 

T7. h xa—ti 3 x=l. (Uniqueness of left identity.) 

T8. I- ax— 1 Ox=a~ 1 . (Uniqueness of right inverse of a.) 

Proof. Assume ax = 1. Now x — lx [T5] = (a~ 1 a)x [T4, E2] = 
a~\ax) [Gl] = a- 1 ! [by the assumption ax— 1 with E3] = or 1 [G2]. 
T9. I- xtc= 1 3 x—a- 1 (Uniqueness of left inverse of a.) 

T10. I- a—fi 3 a~ 1 =^~ 1 . (Equality axiom for _1 .) 

Proof. Assume a = />. Then 6a~ x = aa~ l [by the assumption 
a =fi with E2] = 1 [G3]. So by T8, a~ 1 =#~ 1 . 

Now we have the equality axioms for all our symbols (in Tl, El, E2, 
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E3, T10). So hereafter we can replace on the basis of equalities without 
restriction (Theorem 30 § 29). 

Furthermore, having now both the replacement property of equality 
and the associative law Gl, we shall hereafter write*(rs)t and r(st) as “rst”, 
so applications of Gl become tacit. 

Til. I - ac=fic"D a—fi. (Right cancellation.) 

Proof. Assume ac = fie. Then a = a\ = aec~ l =ficc~ x — fi\ — fi. 
T12. 1- ca=cfi 3 a—fi. (Left cancellation.) 

T13. I - (a~ 1 )~ 1 =a. (Inverse of inverse of a.) 

Proof. (<7. -1 ) -1 = 1 (« -1 ) -1 = e/a-^a- 1 )- 1 = a\ = a. 

T14. I - (afi)~ 1 =fi-~ 1 a~ 1 . (Inverse of a product.) 

Hint for proof: Use T8. 

Next we describe [3] a simpler formal system Gp of group theory. For 
this, we omit the function symbol -1 and the individual symbol 1 from the 
list of formal symbols, correspondingly simplifying the definition of 
“term” and thence of “formula”. We replace G2 aud G3 by the following 
two axioms. 

Gp2. 3 fi ufi=c. (Existence of right quotient.) 

Gp3. 3 a/tfi—c. (Existence of left quotient.) 

That is, the postulates are those of the predicate calculus and the six 
axioms E1-E3, Gl, Gp2, Gp3. This corresponds to defining a group G 
as a system satisfying the axioms Gl, Gp2, Gp3. 

Thl. I -a—a. (Reflexive property of equality.) 

Proof. Substituting a for c in Gp2, 3 fi afi—a. Preparatory to 
3-elim., assume «/=«. Substituting into El, afi=aZ) (erfi=er 3 eo—ei). 
Using 3-elim. twice, a = a. Now we complete the 3-elim. 

Th2. H a—fi 3 fi=a. (Symmetric property of equality.) 

Th3. I- a—fi&c fi—c 3 a=-c. (Transitive property of equality.) 

Th4. F 3 <‘V<7 (Existence of right identity.) 

Proof (using E2, E3, Gl tacitly). Assume for 3-elim. from formulas 
obtained from Gp2 and Gp3 by changes of bound variables (*74 § 24) and 
substitutions: (1) ifi=fi, (2) (3) (j = t, ( 4 ) fin = /, (5) 

(6) fi//=M. (The x of each 3-elim. will be the underlined variable.) Now 

/ = fij [(3)] = firrj [(4)] = fa [(2)] = / [(4)], so by (3): (7) ij=i. Also 
s = 4-m, [(5)] = ifin ((6)] = fin [(1)) = M [(6)], so by (5): (8) By 

(7) and (8), <'=y, so by (2): (9) ai—a. Now we complete five 3-elims., 
discharging successively the assumptions (6)-(2), use V- and 3-introd. to 
get 3 i'ia. ai=/t, and finally discharge (1) by the sixth 3-elim. 

Now we shall sketch why Gp formalizes essentially the same theory as 
G. Since the axioms of Gp (unlike those of G) are symmetric with respect 
to •, without further work we can write down Th5. 
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Th5. 1- 3 fit jt=t. (Existence of left identity.) 

Th6. b Wa at— a & Wt Jt—t 3 i—J. (Equality of right and left identity.) 
Th7. h Wa ai=a& 'iaax=a 3 i—x. (Uniqueness of right identity.) 
(Hint: use Th5, Th6.) For the abbreviation “3!xA(x)’\ cf. §29. 

Th8. h 3<[Ver at— a & 'ixiy/a ax— a 3 #‘=jp)), abbreviated 
h 3!* a . (Unique existence of right identity.) 

(Hint: use Th4, Th7.) The formula ai=a expresses the representing 
predicate 1=» of the right identity 1 as a 0-place function (cf. end § 38), 
and the provable formula of Th8 says that Va ai=a is a representing 
predicate. Now the conditions for the application of the eliminability 
theorem cited in Footnote 156 § 38 are fulfilled. By that theorem, we can 
add the individual symbol 1 to the symbolism of Gp, and the formula 
V« a\ — a, or with the same effect G2, to its axioms, to obtain a system 
Gp t with the following properties. In Gp x , exactly the same formulas not 
containing 1 are provable as in Gp. Each provable formula of Gp x contain- 
ing 1 can be paraphrased, as illustrated at the end of § 38, by a provable 
formula of Gp. 

Now we develop similarly the theory of the inverse a~ x in Gp x . 

Th x 9. Ha-A ThjlO. V^tat—l. Thjll. H3*^=l. 

Th x 12. hat=l &ca=l 3 Th x 13. h at=l & acc—\ 3 t=-'X. 

ThjM. \rl\t at— 1. (Unique existence of right inverse of a .) 

The formula at— 1 expresses the representing predicate a~ l =b of a -1 . By 
a second application of eliminability, we can add to Gp x the function 
symbol ~ l and the axiom G3, obtaining a system Gp 2 . But by Exercise 
39.2, Gp2 and Gp3 are redundant in Gp 2 (being provable in its “sub- 
system” G), and thus they can be omitted as axioms of Gp 2 , doing which 
gives G. 

We conclude hence that in G exactly the same formulas not containing 
-1 or 1 are provable as in Gp, while the provable formulas of G containing 
~ l or 1 can all be paraphrased by provable formulas of Gp. Thus G is in 
the same relation to Gp as the richer systems of number theory alluded to 
at the end of § 38 are to the system N. 

Our examples of groups except (ll), (13), (14) and (15) satisfy an additional 
axiom: 

G4. at=ta. (Commutative law.) 

Such groups are called commutative or Abelian. Formal systems [4] AG and 
[5J AGp for Abelian groups are obtained by adding G4 to the postulates 
of G and Gp, respectively. 

To see that (ll) does not satisfy G4, say the square is in a horizontal 
plane with edges running north and east. Let a be a clockwise rotation of 
90° (as seen from above) about a vertical axis through the center of the 
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square, and b be a rotation of 180° about a horizontal axis running east 
through the center. Then the results ab and ba of performing these two 
rotations in different orders are different, as the student may verify. 
Similarly with (13), (14) and (15). 

Our groups are infinite, i.e. have infinitely many elements in the set (7 0 , 
except (10), (ll) and (14), which are finite. — 

As we remarked in § 37, we get specific formal systems of propositional 
and predicate calculus by supplying suitable definitions of “atom”, 
and of “ion” or of “ion” and meson”. 

We begin with [6] the pure propositional calculus Pp. For this, we 
introduce a new species of formal symbols stf , 38,^ , . . . (script capitals), 
called proposition letters , iao The problem of what to do after the twenty- 
sixth letter can be handled as we did for number theory N, by using a 
formal symbol ( to manufacture additional proposition letters. Most 
treatments simply assume there is a potentially infinite list of proposition 
letters (and likewise of variables for number theory). The other formal 
symbols are 3, &, V, n and parentheses. “Formulas” are defined as in 
§ 1 with the proposition letters now being the atoms (and with parentheses 
handled as in § 38, or in some other suitably explicit manner). If we have 
to talk about both these formulas and those of formal number theory N 
in the same context, the two kinds may be distinguished as proposition 
letter formulas and number-theoretic formulas. The postulates, of course, 
are Axiom Schemata la-lOb and the 3-rule. 

Similarly, we obtain [7] the pure predicate calculus Pd by using as the 
ions sii, s /(— ), , — ), . . . , 38 y 3f(—), J?(— , — ), .... We call 

these predicate letters. w Infinitely many of them are to be available with 
each number n > 0 of open places; we may get these either by assuming 
an infinity of script capital letters as formal symbols, or by using | to 
manufacture others after the twenty-six the printer really has. The other 
formal symbols are ~, 3, &, V, -i, V, 3, variables a, 8, c, . . . (infinitely 
many, or twenty-six with j to form others), the comma and parentheses. 
Thence “formula”, or more specifically “predicate letter formula”, is 
defined as in § 16 with the predicate letters now being the ions (and an 
explicit treatment of parentheses). The postulates are Axiom Schemata 
Ia-lOb, the V- and 3-schema (as originally stated in §21 with just 
variables as the terms), and the 3-, V- and 3-rule. 

““ In systems having particular axioms and a postulated substitution rule (in contrast 
to axiom schemata and a derived substitution rule), 86 .V, 3$, ... are called 

proposition variables. Cf. § 9 just after Example 4. 

In similar treatments of the predicate calculus, what we call “predicate letters” are 
called “predicate variables” (or by some authors “functional variables”); 51 and similarly 
with functions, our “function letters” becoming "function variables”. 
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By adding to the predicate letters as ions also ( — )=( — ) (read “ — 
equals — ”), and to the postulates the open equality axioms for = and for 
each predicate letter with n > 0 open places, we obtain [8] the pure 
predicate calculus with equality Pd = . 

Now let us take as mesons (§ 28) the expressions f, f (— ),/ ( — , — ), . . . , 
y, y( — ), y( — , — ), .... which we call function letters. 160 Here /, y, 
ft, . . . are lower case script letters from the middle of the alphabet, which 
are to be removed from use as variables. (We may either assume an 
infinite supply of these letters as well as of the letters for variables, or 
reserve finitely many for each use and manufacture others using | .) With 
the rest of the formation rules and the postulates taken as in § 28 or § 29, 
we thence obtain [9] the pure predicate calculus with functions Pdf and 
[10] the pure predicate calculus with functions and equality Pdf= . 

In these pure systems, the atoms or ions (or also mesons) are provided 
completely generally, without reference to any particular application of 
logic. 

But also we can have formal systems of propositional calculus and of 
predicate calculus without or with equality by using the formation rules 
of a more complicated or more special system. Thus using “formula” as 
for Pd, Pd = , Pdf, Pdf=, N, G or Gp with just the postulates of the 
propositional calculus (Axiom Schemata la-lOb and the 3-rule), we get 
respective systems [11]-[I7] of propositional calculus. Using “formula” 
(or “term” and “formula”) as for Pd=, Pdf=, N, G or Gp with the 
postulates of the predicate calculus Pd or Pdf, we get systems [i8]-[22] of 
predicate calculus. Using “term” and “formula” as for N, G or Gp with 
the postulates of the predicate calculus with equality Pdf=, we get 
systems [23]-[25] of that. The treatments of propositional calculus, of 
predicate calculus and of predicate calculus with equality in Part I were 
framed to apply to any of these systems and others. 

The systems of logic [151-117], [20H22] and [23]-[25] using the formation 
rules of N, G or Gp are examples of applied systems of logic, as their 
formation rules are chosen with a view to the application of the system to 
a more or less specific subject matter, Thus, an applied predicate calculus 
(illustrated by the “number-theoretic predicate calculus” [20]) has a list 
of s ,> 1 predicate symbols or possibly other ions P 1( . . . , P, (each with 
a specified number p { ^ 0 of open places) and a list of / ^> 0 function 
symbols or possibly other mesons f 1; . . . , f, (each with a specified number 
q ( > 0 of open places). 181 Thence “term” (= “variable” if t = 0) and 
“formula” are defined as above (§ 16, or for t > 0 §28, but with an 

1,1 This covers most uses of the term "applied predicate calculus”. One can also 
consider systems with infinitely many predicate or function symbols, and systems with 
a mixture of these notations and those of the pure systems. 
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explicit treatment of parentheses § 38). For the number-theoretic predicate 
calculus, P„ . . . , P s is (— )=( — ) simply (^ = 1, p x = 2), and f 1( . . . , f, 
is (_)+(-), (-)•(-), 0(/ - 4, q y = q t =-2, q 3 = l,q 4 = 0). In 

an applied predicate calculus, the predicate and function symbols or 
other notations have one or more “intended” or “standard” interpreta- 
tions. (Of course, these can play no part in the metamathematics.) Applied 
systems of logic give a formalization of the logic to be used in a particular 
subject, e.g. number theory, directly in the formalized language of that 
subject. This corresponds closely to how the mathematician or layman 
uses logic. 

By using in place of Axiom Schema 8 the intuitionistic Axiom Schema 
8 1 , we obtain intuitionistic systems [26]-[50] corresponding to the classical 
systems (1 J— [ 25 ]. (Cf. ends §§ 12, 25.) 

Exercises. 39.1. For the group of rotations of a square into itself, 
show that there are 4 or 8 elements according as |he square must remain 
in its plane or not during the motion. What are the interpretations of 1 
and - 1 for this group? 

39.2. Using T1-T5, show that F 3^ and F 3« afi—c. 

39.3. Prove T7, T9, T12, T14. 

39.4. The part of the proof of Th4 ending with (9) demonstrates that 
(1), (2), (3), (4), (5), (6) F(9). Show explicitly the rest of the steps (after 
(9)) in the format (B 3 ), and verify the correctness of each 3-elim. and 
the V-introd. 

39.5. Prove Th6-Th8 and Th 1 9-Th 1 I4. 
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COMPUTABILITY AND DECIDABILITY 


§ 40. Decision and computation procedures. Consider a given 
countably infinite class of mathematical or logical questions, each of 
which calls for a “yes” or “no” answer. 

Is there a method or procedure by which we can answer any question 
of the class in a finite number of steps? 

In more detail, we inquire whether for the given class of questions a 
procedure can be described, or a set of rules or instructions listed, once 
and for all to serve as follows. If {after the procedure has been described) 
we select any question of the class, the procedure will then tell us how to 
perform successive steps, after a finite number of which we will have the 
answer to the question we selected. In performing the steps, we have only 
to follow the instructions mechanically, like robots; no i nsight or ingenuity 
o r invention is required of us. After any step, if we don’t have the answer 
yet, the instructions together with the existing situation will tell us what to 
do next. 162 The instructions will enable us to recognize when the steps 
come to an end, and to read off from the resulting situation the answer 
to the question, “yes” or “no”. 

In particular, since no human performer can utilize more than a finite 
amount of information, the description of the procedure, by a list of rules 
or instructions, must be finite. 

If such a procedure exists, it is called a decision procedure or algorithm 
for the given class of questions. The proble m of discovering a decisio n 
procedure is called th e derisio n problem for thisTciisss. 

For example, there is a decisionprocedure for thtT class of questions 
“Does a divide bV (or “Is a a factor of 6?”) where a and b are any 
positive integers. It consists in performing the ordinary long division of b 
by a and observing whether or not the remainder is 0. 

Likewise, there is an algorithm for determining whether or not an 
algebraic equation 

(1) a 0 x n + a l x n ~ 1 + . . . + a n _-ix + a n = 0 (a 0 ^ 0, n > 0) 

192 In practice such procedures are often described incompletely, so that some in- 
essential choices may be left to us. For example, if several numbers are to be multiplied 
together, it may be left to us in what order we multiply them. 
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with integral coefficients a 0> a lt . . . , a„_ j, a n has a rational root. It 
depends on the theorem that, if such an equation (1) has a rational root 
pjq in lowest terms, then p must be a factor of a n and q a factor of a 0 . So 
only finitely many rational numbers pfq are candidates for roots; and we 
can try them each in turn. 

As a third illustration, there is a decision procedure for deciding 
whether or not, for given integers a, b and c, the equation ax + by + c = 0 
has integral solutions for x and y. For this procedure, which is based on 
Euclid’s "greatest common divisor algorithm,”, we refer to textbooks on 
elementary number theory (e.g. MacDuffee 1954 § 9). 

For a given formal system S, consider the following three general 
questions, i.e. (countably infinite) classes of particular questions: “Is a 
given formal expression a formula?”, “Is a given finite sequence of 
formulas a proof?”, “Is a given formula provable?”. 

As we illustrated in §38, any particular question of the first class can 
be answered by seeking a proper pairing of the parentheses in the expres* 
sion. If one is found, then with the help of it we can attempt to retrace the 
steps by which the expression, if it is a formula, was built up under the 
definitions of “term” and “formula”. In so doing, we shall find out 
whether the expression can or cannot be thus constructed. To answer any 
given question of the second class, we merely consider each formula in 
the given sequence in order, and examine whether it is an axiom or follows 
from earlier formula(s) by one of the rules of inference. The objects which 
must be examined to answer a question of either of these two classes 
are contained as parts of the finite object to which the question 
applies. 

The third class of questions is fundamentally different. To show directly 
from the definition that a formula is provable, one must exhibit a proof 
of it. But the proof, if there is one, need not be made up only of parts of 
the formula itself. One must therefore look elsewhere than within the 
given object to answer the question. The definition of a proof of a given 
formula sets no bound on the length of the proof. To examine all possible 
proofs without bound on their length is not a procedure which leads to the 
answer to the question in finitely many steps in the case the formula is 
not provable. So this third decision problem, i.e. the decision problem for 
provability in the system S, unlike the first two, is not trivial. If there is a 
decision procedure, it is one which is not afforded almost immediately 
by the definition of “provable formula”. Also this decision problem for a 
formal system S is of especial interest. So it is ofte n called the deci sion 
problem for the formal system._ 

For pr ovability in the ^p r opositional calculus , there is the decision 
procedure_found by. Post in 1921 ; i.e,, to determine whether or not V E, it 
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suffices to compute the truth table for E, and see whether or not it has all 
t’s (for, by Theorems 14 and 12, b E if and only if t= E). 

The recognition of the dec ision problems f or formal systems goes back 
to Sch roder 1895. Lowenheim 1915, and Hilbert 19 18. 

It would be of especial import to have a decision procedure for the 
system N of elementary number theory (§ 38). For then the solution to 
many long-unsolved particular problems of elementary number theory 
could be obtained mechanically. This is assuming that only formulas true 
under the interpretation are provable in N. For example, we .could then 
settle the problem of Fermat’s “last theorem”. About 1637 Fermat 
claimed he had a proof that the equation x n + y n = z n has no solution 
in positive integers x, y, z, n for n > 2. No one since has succeeded in 
proving or disproving this proposition. Fermat’s “last theorem” can be 
expressed in N by the formula 

F: &^>0 & 0 & «> 2 & A(&, -ty, y , «/)] 

where A( x, y, y, n) is obtained by paraphrasing x n -f y n = z n to avoid 
the exponent function (cf. end §38). Thus to get A(./', y, y, u), we first 
find by known methods (Footnotes 155 and 156 §38) a formula 
E(u:, ji, /<) expressing x n *s u; then A (^,y, y, ") can be 

3 //3t>3 w[E(&, n, a) & E(y, n, v) & E(y, n, a>) & «+v—<e>]. 

If Fermat’s “last theorem” is false, this fact can be shown by computation 
with a suitable quadruple (x, y, z, n) as counterexample. Corresponding to 
this informal remark, it can be shown that in this case the formula “iF is 
provable in N. We cannot give the details in this brief treatment; it comes 
under the claim we made in § 38 that N is adequate for the usual elemen- 
tary number theory. If Fermat’s “last theorem” is true, then, by the 
supposition we made here that in N only true formulas are provable, “iF 
is unprovable. So a decision procedure for provability in N would enable 
us to decide by a finite number of mechanical steps whether Fermat’s last 
theorem is true or false, by applying the procedure to determine whether 
-iF is unprovable or provable, respectively. 163 

1K> This would constitute at least a significant theoretical gain over the present situa- 
tion (1967), in which no sequence of mechanical steps is known to lead after finitely 
many steps to the answer to the question whether Fermat’s “last theorem” is true or 
false. 

Practically, the answer to the question might still be beyond our reach by the means 
described. For, in applying the given decision procedure for the system N to the formula 
— iF, more space and time than we have available might be required to carry out the 
finitely many steps that lead (o the decision. 

For what we do in this book, it is not essential that we concern ourselves with this 
matter of whether a given decision procedure for a class of questions is or is not practical 
to use in answering certain questions of the class. (That falls under "computer sciences”.) 
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The efforts expended ove r centurie s i n searching for solu tions to this 
and other famous problems ofji umber theory make it implausible that a 
decision procedure for N should exist, it might have seemed equally 
implausible in 1918 that mathematics could find a way to prove th at 
there ca n be no decision procedure fo r N. But exactly this was done by 
Churc h in 1936 on the basis of a thesis of his which we shall propoun d 
in § 41 

We begin by observing that, just as we may have a decision procedure 
or algorithm for a countably infinite class of questions each calling for a 
‘"yes” or “no” answer, we may have a computation procedure or algorithm 
for a countably infinite class of questions which require as answer the ex- 
hibiting of some object. 

For example, there is a computation procedure for the class of 
questions “What is the sum of two natural pumbers a and bV'. We 
learned this procedure in elementary school when we learned to add. The 
long division process constitutes an algorithm for the class of questions 
“For given positive integers a and b, what are the natural numbers q 
(the quotient) and r (the remainder) such that a = bq + r and r < bT\ 
Thirdly, there is the algorithm which bears Euclid’s name for the class of 
questions, “What is the greatest common divisor of two positive integers 
a and bT\ There is also an algorithm for the class of questions “Given a 
formula E in the propositional calculus and a list P 1( . . . , P„ of distinct 
atoms including all occurring in E, what is the truth table for E entered 
from these atoms?”. The last three of these algorithms for what-questions 
are incorporated into three of the algorithms mentioned above for yes- 
or-no-questions. 

The problem of finding a computation procedure or algorithm for a 
class of what-questions is the computation problem for the class of 
questions. 

We have chosen her e to formulate the idea of decision problems and 
compjimi tiom qai^blems-ojihLJ hr cquntablv infinite classes of question s 
(yes-or-no-questions or what-questions, respectively). 

For^a fi nite class of qu estions, the decision or computation problem 

(analogously formulated) i§ trivial fr om the classical standpoint. For 

(theoretically at least), it could be solved simply by preparing a list of the 
answers to all the questions of the class. 

“What is the shortest distance by major highways between any two of 
the principal cities of the United States?” Assuming agreement on what 
are major highways and which are the principal cities, one can find the 
answer to any question of the class from the tabulations given on some 
highway maps. 



§40 


DECISION AND COMPUTATION PROCEDURES 


227 


“Given any permissible position in chess, can white win (irrespective 
of black’s responses)?” It is notorious that in this case, although a finite 
list of the answers for all positions exists theoretically, for practical 
purposes it is unavailable. If it were, the fun in chess would be lost. 

“Is a given one of a certain finite set of propositions {A, B, C, D, E} 
true?” Here A, B, C, D, E are supposed to be five fixed propositions, so 
there are only five questions in our class. An algorithm would be provided 
by the right list of five yes’s and no’s. This would be short and simple. 
However, if A is Fermat’s “last theorem”, no one to date can provide the 
algorithm. O nly a classical mathematician considers it established that the 
algorithm ex ists. An intuitionist would consider the questio n whether 
the algorith m exists to be open until someone has solved the probl em * 
of FermatV^last theorem”. (Cf. § 36.) ~ 

The results to be developed in the rest of this chapter are independent's, 
of such differences of opinion between classical and intuitionistic mathe- / 
maticians on when an algorithm exists. ' 

Returning to our case of algorithms fo r countab l y infinite classes o f 
quest ions , t hey would likewise always exist trivially from a classic al 
standpoint, if we allowed the (descriptions of) algorithms to be infin ite 
objects; we could ag ain simply list all the answers . But we said an algorithm 
Fas to be a method or procedure or set of rules which can be used, and which 
therefore must be finitely described. A. finite set of instructions must he 
provided that will suffice to guide one to the answer of any one of an 
infinite set of question s. 

A discussion of algorithms for uncountablv infinite classes of q uestions 
is outside the scope of this book. 

WRerT we have a countably infinite class of questions, the differep t 
questions of the class will usually arise b y giving different values (“ar gu- 
ments^ t o one or several variables or “parameters” in the general 


■il 




statem ent of the (class of) questions . In several examples above, a and b 
(or a, b and c) play the role of these variables. In our second example of a 
decision procedure, a 0 , . . . , a n are such variables but their number n 
also varies. 

Because our class of questions is always (outside of the digression 
above) to be countably infinite, we can always enumerate the questions, 
say as Q 0 , Q u Q 2 , . . . , Q a , . . .. Then a can be the variable or parameter. 
In the case of yes-or-no-questions, by putting P(a) = {the answer to Q a 
is “yes”}, the infinite class of yes-or-no-questions becomes a o ne-place 
number-theor etic predicate P(a). 

Suppose instead the questions are what-questions. We now further 
assume that the objects to be produced in answering the questions come 
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from a countably infinite class, which we can enumerate as y 0 , y x , 
V 2 , •••,%>• • •• By putting f{a) = b if the answer to Q n is y b , the infinite 
class of what-questions becomes a one-place number-theoretic function 
/(«)• 

jv So via an enumeration, each countably infinite class of yes-or-no 
questions we are to consider can be reduced to the form “Is the value of 
the number-theoretic predicate P{a) for a as argument true?”; of what- 
questions, to “What is the value b of the number-theoretic function f(a) 
for a as argument?”. 

In cases when the class of questions is already given by using a fixed 
number of variables with suitable ranges, it is often more convenient to 
render it directly by a number-theoretic predicate or function. For 
example, “Does a divide bT' can be rendered by a predicate P(a, b), 
which indeed is commonly written “ a\b ”. To standardize in the following 
treatment, we take our variables a, b, c, . . . ,x, y, z to range over the 
natural numbers. For the present example, we can extend the “divides” 
notion to include 0 (a|0 is true for every a; 0|6 is false except for b = 0). 
“What is the sum of a and bl” is obviously rendered by the function 
a -\-b. 

Inversely, starting with any number-theoretic predicate P(a lt . . . , a n ), 
we have the countably infinite class of questions “For given a u ... , a„, 
is F(n,, . . . , a „ ) true?”. Starting with any number-theoretic function 
f(a u . . . , a„), we have the questions “For given a ly . . . , a n , what is the 
value of /(«!, . . . , «„)?”. 

Thus it comes to the same thing whether we talk about countably 
infinite classes of questions or about number-theoretic predicates or 
functions. If there is a decision procedure for a predicate (or for the class 
of questions arising from it), we call the predicate (or class of questions) 
decidable. Similarly, if there is a computation procedure fora function, we 
call the function computable. 

Furthermore, the case of predicates can be reduced to the case of 
functions by defining the representing function f(a x , . . . , a„) of a predicate 
P{a x a„), thus: 


/(«!, . . . , a „ ) 


0 if P(a t> . . . , a n ) is true, 

1 if P(a x a n ) is false. 


In the one-variable case, /(a) is what in §33 we called the “representing 
function” of the set of a’s for which P(a) is true (in symbols, the set 
dP(a)). It comes to the same thing to compute the value of f(a u . . . , a „ ) 
and observe whether it is 0 or 1 as to decide whether P(a u . . . , a n ) is true 
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or false. In other words, a d ecision procedure can he hand led as a 
computation procedure by taking 0 for “yes” and 1 for “no”. 164 

To emphasize the idea of an algorithm by contrast, we consider some 
more cases in which it is not clear that there is one. We have already 
mentioned that there is an algorithm for “Does ax + by + c — 0 have a 
solution in integers?”, or written as a predicate (Ex)(Ey)[ax + by + c = 0], 
Here we are using “(Ex)" and “( Ey )” in the same meaning as 3a; and 3 y 
in Chapter II; but we prefer henceforth to keep the logical symbolism we 
use outside a given formal system distinct from the symbolism we use 
inside, as far as is feasible. 165 The existence of this algorithm is not 
obvious from the start (unlike those for a\b and a-\-b). Some theory due 
to Euclid is used in setting it up. 

/~Let us generalize this example to consider, instead of the first-degree 
je quation ax -f- by + c = 0, the second-degree equation 
fax 2 + bxy + cy 2 + dx + ey -f / = 0 ( a , b, c not all 0) or even any 
i I alg ebraic equation (polynomial equation) of any degree n > 0 in any 
I number m > 0 of variables ay, ... , x m , and make our class of questions 
I “Does any given algebraic equation with integral coefficients have a 
j solution for its variables in integers?”. Th e decision. problem for th is class 
/ of questions is “Hilbert’s tenth problem”, included in his famous list of 
1 23 “future” problems of mathematics 1900a. It has not been solved. 

To make matters as simple as possible, consider some 2-place predicate 
P(a, x) for which we do have an algorithm. It will not directly follow that 
there is an algorithm for ( Ex)P(a , x). The only procedure the definition 
of this predicate suggests directly is, after choosing a value of a, to start 
trying P(a, 0), P(a, 1), P(a, 2), . . . in the hope of finding one which is true. 
We can’t be sure we will by this procedure find out in a finite number of 
steps whether (Ex)P(a, x) is true or not. For (as we already noted for 
3 xP(x) in § 36), if after any finite number of steps we haven’t already 
found an a; which makes P(a, x) true, we won’t know whether it’s because 

164 Since we are concerned here only with the questions whether the propositions 
taken as values of the predicates are true or false, the predicates in this theory are treated 
extensionally, i.e. not distinguished from the corresponding logical functions, which 
become their representing functions on changing t, f to 0, l. 92 

Below we emphasize that sometimes the definition of a predicate or function does not 
directly give an algorithm for it, but an algorithm may be afforded by some theory 
about the function or predicate. For predicates, if we restore the intensional concept of 
them, we could then say that an (int ensiona l) predicate P may be undecidable while an 

equivalent REg dicate P, is decidable^ 1 * g:::s ^ ' “ 

16a Beginning with this chapter (and in a few instances before) 23 we use the following 
informal logical symbolism: = (“equivalent”), — (“implies”), & (“and”), V (“or”), 
(thus A; “not”), (x) (“for all x”), (Ex) (“(there) exists (an) x (such that)”). 
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( Ex)P(a , x) is false for that a, or because we haven’t gone far enough in 
our search. 

To give this example in terms of number-theoretic functions rather 
than predicates (i.e. propositional functions), suppose we have a com- 
putation procedure for f(a, x). We do not at once know whether there is 
one for the function 

0 if ( Ex')/ {a , x)=0, 

/(«)= . . . 

1 otherwise. 

If f(a, x) is the representing function of P(a, x), then f(a) is the representing 
function of (Ex)P(a, x). 

The recognition that for some (countably infinite) classes of mathe- 
matical questions we have algorithms, and that for others we at least 
don’t know any, goes far back in mathematical history. The Greeks, 
including Euclid, sought algorithms; and the term “algorithm” is derived 
from the name of the ninth century Arabian arithmetician al-Khu- 
warizmi. 



can all be red uced to the computation problems for number-theoretic 
functions. 


So to adva nce toward our objective (Church’s theorem ), we have to 
deal with the question: For wha L number-theoretic functions a re_there 
comput ation procedures (or algorithms)? Briefly: What^ is^the class j>f 
“ c o m p u t old el^Jjmction s ? 

What we have to go on in considering this question is a somewhat 
vague intuitive idea of what constitutes a computation procedure. A 
computation procedure may consist in essentially a direct application 
of the definition of the function, or it may consist in something consider- 
ably different which mathematical theory shows must lead to the same 
function values as the original definition requires. 

Although our intuitive notion of a computation procedure is vague, 
it is nevertheless real, as is shown by two circumstances. First, it does not 
leave mathematicians in any doubt or disagreement that they do have 
computation procedures for many particular functions, e.g. a+ 1, a+b, 
a-b, a b , a !, ma \{a, b) (= the maximum of a and b, i.e. the greater if 
a b , the common value if a — b), min(o, b) (= the minimum of a 
and b), a— I (= a— 1 if a > 1, 0 if a = 0), a—b (= a—b if a ^ b, 0 if 
a < b), [a/b\ (= the quotient when a is divided by b if b 0 , 0 if b — 0 ), 
[Va] (= the greatest natural number whose square is a), [e“] (= the 
greatest natural number < e“), etc. Second, there is likewise no doubt, in 
other particular cases, that the definition of a function or a given equiva- 
lent of its definition does not directly give a computation procedure. For 
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example, we agree that the definition of f(a) displayed above does not 
give a computation procedure for it. 

We are only recapitulating what has not been disputed among mathe- 
maticians for over two millenia (if we correctly understand mathematical 
history). 

This intuitive notion of a computation procedure, which is real enough 
to separate many cases where we know we do have a computation 
procedure before us from many others where we know we don’t have 
one before us, is vague when we try to extract from it a picture of the 
totality of all possible computable functions. And we must have such a 
picture, in exact terms, before we can hope to prove that there is no 
computation procedure at all for a certain function, or briefly to prove 
that a certain function is uncomputable. Something more is needed for 
this. 

Most mathematical logicians agree that it was found in 1935 (and 
published in 1936), as we shall see in the next section. — 

Hereafter, we may say that something can be done “effectively”, or 
tnat an operation or process is “ effectiv e”, as a brief way of saying that 
there is an algorithm for it (i.e. a decision or computation procedure). 

Exercises. 40.1. For the following - predicates’ and” functions, or 
classes of questions, do we have an algorithm or do we lack such (at least 
without more knowledge)? For (a) and (b), assume P{a, x ) decidable. 
..... (the least x such that P(a, x) if ( Ex)P(a , x), 

( a ) f( a ) — jo otherwise. 

... .. ,. /the least x <, b such that P(a, x) if (Ex)[x <; b & P{a , *)], 

(b) /^ 6 ) = (6+1 otherwise. 

(c) Is a a prime number? 

(d) What is the nth prime number? (Assume Euclid’s theorem that there 
are infinitely many primes, § 38.) 

|a+l if Fermat’s “last theorem” is true, 

(e) J (a) — jf. p erma p s “| as t theorem” is false. 

(f) For given formulas A u . . . , A,„, B in the propositional calculus, does 
A^ . . . , A m V B hold in the propositional calculus? 

(g) For given formulas A„ . . . , A„„ B in the predicate calculus, does 
A„ . . . , A m 1- B hold in the predicate calculus? 

(h) Given formulas A x , . . . , A m in the predicate calculus, is a given 
finite sequence of formulas a deduction from A lt . . . , A,„ in the predicate 
calculus holding all variables constant? 

(i) For a given finite domain D and a given formula E in the predicate 
calculus, does D-¥ E hold? (Cf. § 17). 

(j) For a given formula E in the predicate calculus, does 1= E hold? 

(k) (For calculus students.) Does a given elementary function (say one 
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composed from rational numbers and the variable x using finitely many 
times addition, multiplication, division, powers, roots, and trigonometric 
and exponential functions and their inverses) have an indefinite integral 
which is elementary? 

(1) (For algebra students.) Does a given system of m linear equations in n 
unknowns with integral coefficients have'a solution? 

40.2. Show that [ V a ] is computable. 

40.3*. (For calculus students.) Show that [e a ] is computable. (Use the 
result of Hermite in 1873 that e is transcendental; cf. § 33.) 

§ 41. Turing machines, Church’s thesis. The situation in J935 
was that a certain exactly defined class of computable number-theoretic 
functions considered by Church and Kleene during 1932-35, called the 
“/-definable functions”, had been found to have properties strongly 
suggesting that it might embrace all functions which can be regarded as 
computable under our vague intuitive notion. This result was somewhat 
unexpected, since initially it was not clear that the class contained even 
the particular computable function a— 1 mentioned above, and a proof in 
1932 (publ. 1935) that it did was the present author’s first piece of mathe- 
matical research. Another class of computable functions, called the 
“general recursive fu nctio ns”, defined by Godel in 1934 building on a 
suggestion of Herbrand, had similar properties. It was proved by CJiurch 
1936 and KJeene 1936a that the two classes are the same, i.e. e ach 2- 
definable functio n is general recurs ive and vice versa. 

"Under these circumstances Church proposed the thesis (published in 
^1936) that all functions which intuitively we can regard as computable, 
or in__his w ords “effectively calculable”, are~ / -definable, oFl Tquiva lently 
general recursive. This is a thesis rather than a theorem, in as much as it 
proposes to identify a somewhat vague intuitive concept with a concept 
phrased in exact mathematical terms, and thus is not susceptible of proof. 
But very strong evidence was adduced by Church, and subsequently by 
others, in support of the thesis. 

A little later but independently, Turing’s paper 1936-7 appeared in which 
another exactly defined class of intuitively computable functions, which 
we shall call the “Turing computabl e functions”, was introduced, and the 
same claim was made for this class; this claim we call Turing's thesis. 
It was shortly shown by Turing 1937 that his computable functions are the 
same as the 2-definable functions, and hence the same as the general 
recursive functions. So Turing’s and Church’s theses are equivalent. We 
shall usually refer to them both as Church's thesis , or in connection with 
that one of its three versions which deals with “Turing machines” as the 
Church-Turing thesis. Post in 1936, independently of Turing, published 
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rather briefly a formulation fundamentally the same as Turing’s. In 1943 
he published a fourth equivalent, using ideas from unpublished work of 
his in 1920-22. 166 Still another equivalent formulation is provided by 
Markov’s theory of algorithms 1951c. 

Turing’s machine concept arises from a direct effort to analyze com- 
putation procedures as we know them intuitively into elementary 
operations. Turing argued that repetitions of his elementary operations 
would suffice for any possible computation. For this reason, Turing com- 
putability suggests the thesis more immediately than the other equivalent 
notions, and so we choose it for our exposition. 

Turing described a kind of theoretical computing machine. This differs 
in two respects from a human computer working under preassigned 
instructions or an actual digital computing machine (such as a desk 
calculator, or a high-speed computer with electronic tubes or transistors). 
These are respects in which we idealize by divesting the human computers 
and the physical machines of their practical limitations. 

First, a “Turiqg mac hine” is not liable to errors ; i.e. it obeys the intended 
laws for its action without any deviations. 

Second, a “Turing machine” is given a potentially infinite memory . 
That is, although the amount of information stored at any one time is 
finite, there is no upper bound on this amount. Information to be stored 
may include (at one time or another) the statement of the particular ques- 
tion given the machine, the scratch work performed by the machine 
while arriving at the answer, and the answer. To allow for unlimited 
storage of such information, we consider as separate the machine proper 
and a peripheral storage facility, which we will take to be an i nfinite “ tape”. 

The machine proper, which does the computing and thus determines 
what function is computed, admits only a fixed finite number of possible 
“states”. It represents the finite list of rules or finite description of a 
procedure in our intuitive notion of an algorithm, § 40. (Also information, 
but only up to a fixed amount, can be stored momentarily by the machine’s 
assuming one or another of its “states”.) 

Now we formulate our notion of a Turing machine in detail. We number 
moments of time for the operation of the machine as 0, 1 , 2, . . .. At 
any given moment, the machine shall be in one of k + 1 states, which 
we number 0, 1, . . . , k. State 0 we call the passive state', the others, 
active states. A linear tape ruled in squares passes through the machine 
(when it is set up for operation). The tape is potentially infinite to the 
right. Each square is either blank s 0 or has printed on it one of a given 
finite list of symbols s x , . . . , s, ; so s 0 , . . . , s y are the possible square 

166 An account (from 1941) of this work of Post in the 1920’s was published posthu- 
mously in Davis 1965 pp. 338-433. 
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conditions. However, only a finite number of squares are printed at any 
moment in the use of the machine. At each moment, beginning with 
Moment 0, one square of the tape is scanned by the machine. 

Now consider any moment when the machine is in one of its active 
states k. Between this moment and the next, the machine performs 

an act consisting of a sequence of three operations (a), (b), (c), each 
operation coming from the respective category as'follows: (a) print one 

of the symbols S! s, on the scanned square (supposed blank at the 

given moment), or erase the scanned square (supposed printed at the 
given moment), or erase and print one of s lf . . . , s 3 - on the scanned square 
(supposed printed at the given moment), or make no change in the scanned 
square; (b) move the tape so as at the next moment to scan the square 
next to the left of the scanned square (briefly, move left), or leave the tape 
unmoved (briefly, stay centered), or move the tape so as at the next 
moment to scan the square next to the right of the scanned square (briefly, 
move right); (c) change to another state, or remain in the same state. 
What act (within these possibilities) is performed, between a given moment 
at which the machine state is active (one of 1, .... Ac) and the next moment, 
is determined by the machine state and the scanned-square condition 
(one of s„, . . . , Sj) at the given moment, with one exception to be 
explained presently. We call the machine state together with the scanned- 
square condition at a given moment the configuration. In contrast, the 
machine state together with the specification of the scanned square and 
the entire printing on the tape we call the ( machine vs. tape) situation. 

The exceptional case in which the configuration at the given moment 
does not determine the act is that the configuration would call for a 
motion left when the scanned square is already the leftmost square of the 
tape. Then the (b) and (c) parts of the act are altered to: stay centered, 
and assume the passive state (briefly, stop). The machine “jams”. We 
could avoid this exception by assuming a 2-way infinite tape. 167 

If at a given moment the machine is in the passive state 0, no act is 
performed between this and the next moment, i.e. the machine does not 
print or erase, does not move, and does not change from state 0. 

We shall give an illustration of the operation of a Turing machine 
presently. However, let us first define how such a machine is to be used to 
compute _a number-theoreti c function . (Turing used his machines primarily 
to carry out continuing computations of decimal fractions for real 

167 We begin thus in TM p. 357; but the results are essentially the same.' 08170 

Our discussions of Turing machines (beginning with one in our seminar on the founda- 
tions of mathematics at Wisconsin in 1941) follow Turing 1936-7 in the general 
conception of the behavior of the machines, but not in the detailed formulation and 
development. Cf. IM top p. 361, 
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numbers.) For this purpose, we must agree how the argument(s) (i.e. 
value(s) of the independent variable(s)) are to be represented on the tape, 
and how the machine is to give us the resulting value of the function. 
We shall make the supposition that all machines to be considered have 
among their symbols the tally mark say it is s x . We shall represent 
natural numbers by sequences of tallies, “|” for 0, “||” for 1, “|||” for 
2, . . .. To set up the machine and tape to compute for a given argument 
a, we shall arrange that: at the moment 0 the system consisting of machine 
and tape is started off so that the leftmost square of the tape is blank, a 
is represented by tallies on the next a+1 squares, all squares to the 
right of these are blank, the machine is scanning the rightmost printed 
square, and is in its first active state 1. In this situation, we say the machine 
is applied to a as argument. We say the machine computes a value c for a as 
argument, if, starting from this situation at Moment 0, the machine at 
some later moment assumes the passive state 0 (“stops”) with a blank and 
c+1 tallies printed on the tape after the a + 1 tallies representing the 
argument a, the tape being otherwise blank, and the rightmost printed 
square being again the scanned square. 168 

A given machine may compute a value for each natural number a as 
argument, or for some a's but for others, or for no a' s. If, for each a, it 
computes a value c where c = /(a), we say that the machine computes the 
function /(a), and that f(a) is Turing computable. 

Similarly, for functions of more than one variable. 

For example, a machine is applied to 1 as argument, if at Moment 0 it 
is in the following situation 


where the “1” written over the third square shows that this is the scanned 
square and the machine is in state 1, and all squares to the right of those 
shown are blank. A machine computes the value 2 for 1 as argument if, 
having been stated at Moment 0 in the situation above, it will at some 
later moment x reach the situation 

0 


where again all squares to the right of those shown are blank. If in 
similar fashion, when the machine is started with any n+1 tallies on the 

1,8 In this case, the machine cannot during the computation attempt a motion left- 
ward from the leftmost tape square; for, in that case State 0 would be assumed without 
the last of two successions of tallies separated by a blank being scanned. 
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tape, it will eventually stop with these followed by a blank and ci+2 
tallies (the last of them scanned), the machine computes the function 
f(a) — r/+ 1 ; the above illustrates this for a = 1. 

Now we shall describe a machine £ which does compute this function 
f(a) = a + 1 (the successor function), and we shall follow it through its 
acts in computing for the argument a — I. The machine will have only 
tiie one symbol “j”. To save drawing a picture of the tape each time, we 
shall use sequences of numerals “0” and “1” where a “0” stands for a 
blank square and a "I” for a square printed with tally. Thus in the 
illustration above we would write the initial situation 


0 1 l 1 00 0000..., 

and the situation at the moment the computation is completed 
0 1 1 0 1 1 1 ° 0 0 


To describe a machine, we need merely tell for each of its active states 
1, . . . ,k and each of the y'+l scanned-square conditions (blank s 0 or 
printed with Sj, . . . , s,) what act it is to perform. For the machine S we 
describe now, there are to be 11 active states, and 2 conditions of the 
scanned square, namely blank and printed with “|” (or as we write them 
more conveniently, “0” and “1”). The acts it is to perform for each of the 
configurations of these 1 1 active states and 2 scanned-square conditions 
can be shown by the machine table at the left below (top p. 237). 
In the table “P” means “print”, means “erase”; “L”, “C”, “R” 
mean “left”, “center” (i.e. don’t move), “right”; and the number at the 
end of each table entry is the state the machine is to assume at the 
succeeding moment. 

At the right, we follow out the acts of £ in computing a + 1 for the 
argument a = 1. At Moment 0, in the sample computation at the right, 
we see that a printed square (i.e. a 1) is scanned in state 1 ; so we enter the 
machine table in the first row and second column finding “£2”, i.e. go 
right and assume State 2. The resulting situation is shown in the sample 
computation opposite Moment I. Now we have a blank square scanned in 
State 2, so we enter the table in the second row and first column, finding 
“/?3”, i.e. go right and assume State 3. The result is shown opposite 
Moment 2. Now the scanned square is still blank but the state is 3, so by 
the table (third row, first column, where it reads ‘TL4”), the machine 
prints a “|”, goes left, and assumes State 4, with result shown at Moment 3. 
Continuing, we find that at Moment 23 the machine has computed the 
desired value 2 (= a + 1 for a = 1) shown by three tallies. To see that the 
machine computes f(a) = a+1, we must convince ourselves that it will 
compute the value a+ 1 for every value of a as argument. We have done 
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Table for Machine S which 

Computation by Machine S 


computes f(a) 

= a + 1 



for a = 

= 1 




Machine 

Scanned square 




Machine 

vs. tape 


state 

condition 

Moment 



situation 




0 

1 

0 

0 

1 

l 1 

0 

0 

0 

0 




1 

0 

1 

1 

0 2 

0 

0 

0 

1 

CO 

R2 

2 

0 

1 

1 

0 

0 3 

0 

0 

2 

R3 

R9 

3 

0 

1 

1 

0 4 

1 

0 

0 

3 

PL4 

R3 

4 

0 

1 

l 5 

0 

1 

0 

0 

4 

L5 

L4 

5 

0 

l 6 

1 

0 

1 

0 

0 

5 

L5 

L6 

6 

0 

1 

F 

0 

1 

0 

0 

6 

R2 

R1 

7 

0 

1 

0 

0 7 

1 

0 

0 

7 

RS 

ER1 

8 

0 

1 

0 

0 

l 8 

0 

0 

8 

7?8 

R3 

9 

0 

1 

0 

0 

1 

0 3 

0 

9 

PR9 

L10 

10 

0 

1 

0 

0 

l 4 

1 

0 

10 

CO 

ERl 1 

11 

0 

1 

0 

0 4 

1 

1 

0 

11 

PCO 

7?1 1 

12 

0 

1 

0 5 

0 

1 

1 

0 




13 

0 

P 

0 

0 

1 

1 

0 




14 

0° 

1 

0 

0 

1 

1 

0 




15 

0 

P 

0 

0 

1 

1 

0 




16 

0 

1 

0» 

0 

1 

1 

0 




17 

0 

1 

1 

0 9 

1 

1 

0 




18 

0 

1 

1 

1 

p 

1 

0 




19 

0 

1 

1 

J10 

1 

1 

0 




20 

0 

1 

1 

0 

l u 

1 

0 




21 

0 

1 

1 

0 

1 

l 11 

0 




22 

0 

1 

1 

0 

1 

1 

0 U 




23 

0 

1 

1 

0 

1 

1 

1 ° 




24 

0 

1 

1 

0 

1 

1 

1 ° 


this only for a = 1 ; but the reader should not find it too hard to get the 
“hang” of how this machine operates to see that it does so for every a. 

To illustrate computation of a function of 2 variables, a machine to 
compute /(a, b) = a-\-b, when started (for a = 3, b = 1) in the situation 


o 



should later stop (i.e. assume State 0) in the situation 
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The machine that computes a+1 is sufficiently complicated so the 
reader may well wonder how to find machines to compute complicated 
effectively calculable functions. Of course we arc only interested here in 
the theoretical possibility of finding a machine to compute any given 
effectively calculable function, and not in whether the machine operates 
efficiently. The business of finding machines can be systematized by 
starting from the theory of recursive functions.™ 9 This theory deals with 
recursive definitions of functions, such as 

fa+O = a, fa-0 =0, (a 0 ~ 1, 

(a+Z>' = ( a+b )', \a-b’ = a-b+a, a h = a h a. 

How these definitions define the functions was illustrated in § 38. The 
functions commonly used in number theory are definable by use of such 
recursions, and proceeding from the recursive definitions one can in a 
systematic way find corresponding Turing machines, after first setting up 
Turing machines for such simple operations as filling in with tallies all 
but the rightmost of a sequence of blank squares preceded and followed 
by a tally, copying a sequence of tallies, etc. 

In giving the arguments to the machines and receiving the values, we 
have used only one symbol “|” besides the blank (i.e. two square conditions 
“0” and “1”), and likewise in all the action of Machine S. The definition 
of “Turing machine” allows more symbols. To write a machine table for 
a machine with j symbols Sj, . . . , s y , there will be j + 1 columns. In the 
case / > 1, “ P ” is ambiguous; then in place of “P” or or the absence 
of either we can write i in decimal notation to indicate that the square 
condition at the next moment is to be s* (0 < i <,j). 

While we are thus leaving open the possibility of using j+l >2 
square conditions instead of only 2, the fact is that we get no larger class 
of computable functions thereby. 170 

It may seem a little anomalous that, after we have argued that any 
intuitive computation should be performable using only such operations 
as a Turing machine admits, we should find it quite an exercise to see that 
a simple function like a+1 is Turing computable. However we have 

169 E.g. IM Part III. The three definitions shown next are “primitive recursions” 
(IM Chapter IX); but the theory extends to more complicated sorts of recursions, all 
defining “general recursive functions” as already mentioned (IM Chapter XI). 

1,0 This is shown in IM Chapter XIII, by using only the one symbol | in proving that 
every general recursive function is Turing computable, while allowing j symbols 
s u . . . , s, for any j > 1 in proving that every Turing computable function is general 
recursive. 

The same method shows that we could have allowed a 2 way infinite tape without 
getting more (or less) computable functions. 
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started at the very beginning, and we have done it with a Turing machine 
with only one symbol “|” (besides the blank), though we didn’t need to. 
Use of more symbols would make the Turing machine action better 
resemble informal computation. 

Moreover, our illustration may unduly suggest that the computer is 
restricted to take an ant’s eye view of his work, squinting at one square 
at a time. However, when more than the one symbol “|” (besides the 
blank) is allowed, it is possible to interpret what we mean by a symbol 
liberally. There is no reason why individual tape squares cannot be 
considered to correspond to whole sheets of paper, each ruled into 
finitely many squares each of which may receive one of finitely many 
primary symbols. What can be written on the whole sheet can then be 
construed as a single symbol for the Turing machine. If the sheets are 
ruled into 20 columns and 30 lines and 99 primary symbols are allowed, 
there are 10 1200 square conditions s 0 , . . . , s, (; = lO 1200 - 1), and we are 
at the opposite extreme. The schoolboy doing arithmetic on 8|" x 11" 
sheets of ruled paper would never need all this variety. He would usually 
operate just by changing the condition of the sheet of paper before him 
(= the scanned square), only occasionally transferring figures from one 
sheet to the next or preceding sheet in his stack of paper (= moving right 
or left). 

From this point of view, the tape square represents whatever one looks at, 
at a given moment, which may not be small. 

Actually the schoolboy at a given moment would directly perceive only 
a part of what appears on the sheet of paper; the rest, to the extent it 
■affects his act from that moment to the next, he must remember in the form 
of a state of mind. Thus, psychologically, it is something between the 
small squares on the paper and the whole sheet that plays the role of the 
Turing machine tape square. 

Another representation of the Turing machine tape is a stack of IBM 
cards, each card constituting a single square for the Turing machine. 

Returning to the schoolboy, it is necessary for our concept that the 
paper be ruled in squares (at least imaginary squares), and that the symbols 
be from a given finite list, so as to make it exact what the whole finite 
range of possibilities is. That is, there must be only finitely many possible 
conditions for a sheet of paper or a Turing machine tape square. Likewise, 
there are to be only finitely many states of mind for the human computer 
or machine states for a given Turing machine. As Turing wrote, “the 
number of states of mind which need to be taken into account is finite. . . . 
If we admitted an infinity of states of mind, some of them wili be 
‘arbitrarily close’ and will be confused.” The finite numbers of square 
conditions and of states of mind can of course be very large. We are 
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considering digital computation, where the data are discrete. This is in 
contrast to analog computation, such as is performed by a slide rule or a 
differential analyzer, where the data are positions on a scale or scales, 
representing real numbers only to within some limit of accuracy. The 
inputs for our computations are to be natural numbers, or they could be 
something similar, like words in an ordinary language, or formal 
expressions in a formal system; likewise, the output. Discreteness must be 
preserved throughout the whole computation. At each moment, the 
existing (finite) configuration, comprising both the condition of the paper 
or the tape square, and the state of mind of the computer or the machine 
state, must completely determine the act to be performed. Only thus will 
the computation lead to a discrete result, not to something approximate, 
and to such a result that is fully predetermined (when it exists at all) by 
the initial situation and the computer’s instructions or the machine table. 

There is however an intrinsic difficulty that our theory of computation 
must meet. This arises because we define afuncbom/(&)-ta-be-comp.utable 
exactly ifjhcre is a m achine which will compute its value for all choice s 
of a. Thus we must be prepared to carry out computations for arbitrarily 
big values of a. The schoolboy computing on sheets of paper is not 
normally required to work with numbers too big to go on one of his 
sheets. If he had to do this, he also would then be forced to work in a 
groping manner as our machine <3 does in computing values of a+1. 

For further discussion of Chu rch’s thes is , or the Church-Turing th esis, 
we refer to the literatu re. 171 In the next sections, we deal with consequences 
of it. 

171 In IM the attempt is made to summarize all the evidence for Church’s thesis: 
§ 62 gives a general summary, supplemented by p. 352; and § 70 gives the part of the 
evidence which pertains to Turing’s machines, supplementing the present discussion. 

The following argument is given in IM, concluding on p. 352. A great stock of 
intuitively computable functions (all that have been investigated in this connection) are 
known to be Turing computable. Likewise, we have a great stock of methods or opera- 
tions (for obtaining new intuitively computable functions from others) which are paral- 
leled by operations for building new Turing machines from given Turing machines (or 
the analog in terms of recursiveness). If there were a function which is intuitively com- 
putable but not Turing computable, it would have to be “inaccessible” by any process 
of building up toward it from this stock of functions and operations already mastered. 
IM § 66 gives a theorem (the recu rsion theorem , Klee ne 1938) a pplicable to the 
building proccss H generarTTisTiard to imagine how one could give a description, or a 
set of instructions, for a computation procedure that a human computer could follow, 
except by putting the description together out of simpler elements, already known, 
and then it would come under this theorem. 

For an argument against Church’s thesis and a reply, see Kalmar 1959 and Mendelson 
1963. In reading such discussions, we must not lose sight of our intuitive concept of an 
algorithm or computation procedure (§ 40). An algorithm in our sense must be fully 
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Exr.Rei.srs. 41.1. (a) Write out the computation by Machine S when 
the argument is 0, i.e. starting from the situation 0 l 1 0 0 0 — 

(b) Explain the operation of Machine S so as to make it clear that S 
does compute the function a+ 1. 

41.2. Construct the table for a machine ® which decides whether the 
argument a is even; i.e. & is to compute the function 

10 if a is even, 

J\ a > — |i jf a j s oc jj 

41.3. Modify the table for the machine S to obtain: 

(a) a machine 3 which computes the identity function f{a) - a (i.e. 3 
just copies a). 

(b) a machine ^ which computes the predecessor function f(a) = a— 1 

(end § 40). , 

41.4*. Show that, if f{a) and g(a ) are Turing computable, so is the 

function h(a) —f(g(a)). 

41.5. Show that no function f(a) would be Turing computable under 
the above definition modified to allow only a finite tape in the machine. 

41.6*. Show that no more functions f(a ) would be Turing computable 
under the above definition modified so as not to require the scratch work 
to be erased. 


and finitely described before any particular question to which it is applied is selected. 
When the question has been selected, all steps must then be predetermined and per- 
formable without any exercise of ingenuity or mathematical invention by the person 
doing the computing. None of the skeptics about Church’s thesis has come forward 
with a plausible suggestion of what an algorithm (in our sense) which would not be 
mechanizable in Turing’s manner could be like. (Of course. Church’s thesis would be 
disproved, if someone should describe a particular function, authenticated unmistakably 
as “effectively calculable” by our intuitive concept, but demonstrably not general 

recursive.) , , 

We have been assuming without close examination the converse of church s 
mtsis: If a function is Turing computable (or genera! recursive , or A- definable ), then it is 
intuitively computable (or effectively calculable). In defending this implication to an 
mtuitionist, or to any other kind of constructivist who considers an algorithm to exist 
only when it is proved by his standards that it always works, we only ask him to accept 
the following- if the hypothesis that a function is Turing computable holds by his 
standards, so does the conclusion. Put thus, it is hard to see how it can be questioned. 
Only if one allows a nonconstructive interpretation of the hypothesis, and yet insists 
on a constructive interpretation of the conclusion, is the converse of Church’s thesis in 
doubt. (“Church's thesis" is sometimes understood to include this converse, as indeed 
Church 1936 maintained both implications in proposing to identify the effectively 
calculable functions with the general recursive functions or the A-definable functions.) 
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§ 42. Church’s theorem (via Turing machines). 172 We have seen 
that the pattern of behavior of a given Turing machine is determined by 
the table for it; if we know the table, essentially we know the machine. 

As we remarked in §41, the use of “/ ) ” for “print” is ambiguous when 
there is more than just the one symbol “|” or s t ( j = 1). We now write the 
table entries using the other method explained there, which is suitable for 
any number j > 1 of symbols s u . . . , s ; . Under this method, a table entry 
“7/J3” would mean that the square scanned at the given moment t is at 
the next moment t + 1 to have Condition s 7 (i.e. it is to have the symbol s 7 
printed on it), and (as before) the machine is to shift the tape between 
Moments / and /+! so that at Moment f+l the scanned square is next 
to the right of the square scanned at Moment /, and at Moment ?+l the 
machine is to be in its state numbered 3. This table entry could appear 
only in the table for a machine with at least 7 symbols and at least 3 active 
states. 

Rewriting the table for our machine S in this manner, it becomes: 

Machine Scanned square 
state condition 

0 1 

1 0C0 \R2 

2 0/?3 IR9 

3 1 £.4 l/?3 

io oco o/m 

u ico i/m 

The table for a machine can be written in code form. Consider the table 
for 3, originally given in § 41 and repeated now. Using the latter way of 
writing it, let us insert semicolons at the end of each row oi entries, and 
commas separating entries within a row, and then string the entire body 
of the table along as one sequence of symbols: 

OCO, 1 R2 ;0/?3, 1 R9 ; 1 L4, 1 /?3 ; ... ;0C0,O/?l 1 ;1C0,1/?1 1 

This sequence of symbols is the code for the machine 3. 

The code for any machine can thus be written on a typewriter with the 
following 15 symbols: 

LC R, ;01 23456789 

IT - In this anil the next two sections we follow the treatment in Klecnc 1958 pp. 
145-147 (some of which appeared earlier in 1956a, 1957b). 
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Such a code does not begin with the symbol L. By reinterpreting these 
symbols as the digits of a number in the number system based on 15, we 
get a positive integer which describes the machine table and thence the 
pattern of behavior of the machine; call this number the index of the 
machine. 173 

Now let T(i, a, x) stand for the following: 

i is the index of a Turing machine (call it “Machine 902/’) which, 
when applied to a as argument, will at Moment x (but not earlier) 
have completed the computation of a value (call that value 
>,(")”)• 

This predicate (i.e. propositional function) T(i, a, x) is “decidable”. For 
suppose values of /, a, x are given. Then we can determine whether t, in the 
15-system of notation, does describe the table for a machine. If it does not, 
T(i, a, x) is false. If it does, then we can follow out the operations performed 
by that machine W, (as in the illustration in § 41), starting it at Moment 0 
to compute for a as argument, and continuing to Moment x. Finally, in 
this case, we can see whether at this moment 3M; has just completed the 
computation of a value. If so, T(i, a, x) is true; if not, false. (For example, 
if i is the number shown above in the 15-notation, then T(i, 1, 23) is true, 
but T(i, 1, x) is false for every x ^ 23.) 

This should make it clear that T(i, a, x) is decidable in the vague intuitive 
sense (§40). It is then implied by Church’s thesis, or the Church-Turing 
thesis (§41), that it is (Turing) decidable in the strict sense that a Turing 
machine exists which decides it, i.e. computes its representing function 
r (/', a , x), whose value is 0 when T(i, a, x) is true and 1 when T(i, a, x) is 
false (§40). A full treatment of the subject would call for showing this 
without appeal to Church’s thesis. One would not do this from scratch, 
but by taking advantage of theory developed for this purpose, and alluded 
to in §41. In this way we would establish the Part (A) of the following 
theorem. (In this chapter we are only giving a survey, in which sometimes 
we will only be able to describe the considerations on which the full proofs 
are based.) 

173 We are using the “method of digits” end § 32 (and IM § 1), except not bothering to 
close up the gaps in the resulting numbers (which there is no need for us to do for our 
present purposes). 

We have chosen this particular method of indexing, in expositions since 1956a, as 
being easy to explain. If we were to deal now with the subjects below in greater detail, 
it would be advantageous to select a method to make the work as easy as possible. Some 
other methods have been more commonly used in the literature. But work done in 
Smullyan 1961 (not quite in the present connection) establishes that the present system is 
no harder to use. 
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Theorem I. (A) The predicate T(i, a, x) is decidable. § 

(B) (f\{a) as a partial function of i and a is computable. 

To explain Part (B) of the theorem, first observe that the “quantity” 
(f t (a) in the definition of T(i, a , x) is not defined for every / and a; indeed 
it is defined, for a given / and a, exactly if there exists an x such that 
T(i, a, x), or in symbols exactly if ( Ex)T(i , a, x). In Chapter II we would 
have written this “3a :T(i, a, x)", but here we prefer to save “3x” for use 
in formal systems, while employing “(£*)” informally in discussing those 
systems. 1115 

For a value of / which is the index of a Turing machine that computes 
a one-place number-theoretic function, <p { is the function computed. Such 
values of i are described by (a)(Ex)T(i, a, x), where “( a )” means “for all 
a”. 165 

As a function of / and a both, (pfa) as we remarked is defined exactly if 
( Ex) T(i, a, x). So it is a partially defined number-theoretic function of two 
variables i and a, or briefly a partial function. 

However, for i and a for which it is defined, we can find its value, thus: 
given such i and a, from / we find the table for Machine then we 
apply (or imitate) 3W, by performing its steps, starting at Moment 0 with 
a as argument, up to the moment x for which T(i, a, x) is true, and finally 
we read from the resulting situation the value computed. This process 
is an algorithm or decision procedure in the sense of § 40; here the count- 
ably infinite class of questions is “What is the value of (pfa)T' where (/, a) 
ranges, not over all pairs of natural numbers, but over exactly those pairs 
for which ( Ex)T(i , a, x). 

We extend the definition given in §41 for “total” number-theoretic 
functions to say now that a partial number-theoretic function of two 
variables is computed by a Turing machine, if that machine computes the 
value of the function for just those pairs of arguments for which the func- 
tion is defined, and computes no value for other pairs of arguments. 
Similarly, for //-place partial functions for any n. The Church-Turing 
thesis applies to partial functions on the same grounds as to total func- 
tions (§ 41 ). 174 " 

Via this extension of the thesis, it follows from our having an algorithm 
for finding the values of <p t {a) that there is a machine U which computes 
<f ,(o) as a partial function of i and a. Turing showed directly, without 
appeal to the thesis, that such a machine U exists, though in a little different 
situation (he dealt with decimal expansions of real numbers); and we can 
do so in our situation. Doing so constitutes the full proof of Theorem I (B). 

A machine U which computes <p,(a) as a partial function of / and a we 

171 For a fuller discussion, cf. IM pp. 331-332 and § 68. 
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call a universal machine, since it can be used to compute any computable 
function <p(a). To use it to compute </>(a), say that <p(a) is computed by 
Machine 'JJl,; then apply U to compute for i, a as a pair of arguments. 
Thus i plays the role of a set of instructions or program for U which tells 
H what function of a to compute. 

The next theorem we can prove in full detail. 


Theorem 11 . The function y>(a) defined by 


y>{a) = 


Tafi) 4- 1 if 
0 otherwise 


is not computable. 


(Ex)T(a, a, x), 


Proof. Suppose y> (a) were computable; say machine computes it, 
so that y>{a) — (pfa) for all a. Substituting p for a, 

y>(p) = Tpip)- 

But since computes yj(a), we have, for all a, ( Ex)T(p , a, x), and in 
particular ( Ex)T(p,p , x). Using this in the definition of y>(a). 


y>(p) = <Pp(p) + i- 

The two displayed equations contradict each other. 

To state the proof a little differently, we can consider each Turing 
machine 9J1„, and see that it will fail to compute y>(a) correctly for a = p, 
as follows. To begin with, SR P may fail to compute any value for p as 
argument. But if does compute a value for p as argument, that value 
is <p„{p), by the definition of <p,(a ); and in this case ( Ex)T(p,p , x ), so that 
the correct value of y>(p) is <p„(p) + 1 by the definition of y >{a). — 

The significance of this result comes from the Church-Turing thesis, by 
which computability in Turing’s sense agrees with the intuitive notion of 
computability. Accepting the thesis, as most workers in foundations do, 
the director of a computing laboratory must fail if he undertakes to design 
a procedure to be followed, or to build a machine, to compute this func- 
tion y(a). This refutes the notion, which news reports on modern develop- 
ments in high-speed computing tend to foster in the public mind, that 
machines can do everything. The theorem does not assert that there is any 
particular value of ip{a) that we cannot learn. But in whatever model we 
freeze the design of a computing procedure or Turing machine, we will be 
short of having a procedure or machine that can compute all values of 
f(a). If the values it computes are correct, there must be some values it 
cannot compute; in particular, it cannot compute the value tp(p) where p 
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is its own index (or for a procedure, the index of a machine which mech- 
anizes the procedure). To improve the pro cedure or machine m ust Jake 
ingenuity^. xomethjngjhat cann ot be h uilt into the machine . 

It should have been apparent that there must be uncomputable number- 
theoretic functions, as soon as we met the Church-Turing thesis in §41, 
by which the set of different possible machines is countably infinite because 
each machine is dcscribablc by a finite table in a fixed symbolism (cf. § 32). 
The set of the machines being countable, the set of the functions com- 
putable by machines is countable, while the set o f all t he number-theoretic 
functions is uncountable (§ 33). But it remains of interest to see how simple 
examples of uncomputable functions we can give. Our example (f(a) in 
Theorem II) is really quite simple, as it is obtained by completing the 
definition (by the “0 otherwise”) of a suitable computable partial function. 

Why can’t we compute y>(a) from its definition ? Answering this question 
gives the next theorem. 


Theorem III. 
function 


The predic ate (Ex)T(a. a , x) is undecidable; i.e. the 


'M = 


0 

,1 


if ( Ex)T(a , a, x), 
otherwise 


is uncomputable. 

Proof. If we could decide ( Ex)T(a , a, x), we could compute the 
function y>(a) of Theorem II thus: Given a, decide whether ( Ex)T(a , a, x) 
or not. If this decision gives “yes”, imitate the behavior of 93t a for a as 
argument to compute f a (a), and add 1 to the result. If this decision gives 
“no”, simply write 0. 

Again, as in proving Theorem I, we are giving the idea of the proof, not 
the full technical details. What needs to be supplied is the hypothetical 
construction of a machine to compute y(a) from one (supposed given) to 
compute x(a). This is pretty easy to give, once one has reached the stage 
in the development that he would have reached in proving Theorem I. 

Discussion. Theorem III is essentially Chu rch’s theorem, which 
appeared with his thesis in his 1936 paper entitled ‘^An unsolvable problem 
of elementary number theory”. The difference is that we have given an 
example in terms of Turing computability, whereas Church’s example was 
in terms of “/.-definability” (beginning § 41). The problem that is “unsolv- 
able” is to find a decision procedure for the predicate ( Ex)T(a , a, x). Of 
course the problem is solved in another sense, by its being shown that 
there cannot be the required decision procedure. The problem of trisecting 
the general angle by a ruler and compass construction is unsolvable in one 
sense; but in another is solved by its being shown that the required con- 
struction cannot exist. 
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We note the especially simple logical form of the undecidable predicate, 
namely (Ex)T(a, a, x), where as an almost immediate corollary of Theorem 
1 (A) T(a, a, x ) is a decidable predicate. This is what is accomplished in 
Church’s theorem, compared to simply contrasting the nonenumerability 
of the set of all (extensional) number-theoretic predicates (given by Cantor’s 
diagonal method) with the enumerability of the set of the computable 
number-theoretic predicates (given by Church’s thesis). 164 

Exercises. 42.1. Show that there is no algorithm for deciding whether 
a given Turing machine started in a given situation eventually stops (the 
•’halting problem” for a Turing machine). 

42.2. Find an undecidable predicate of the form ( x)P{a , x) where P{a, x) 
is decidable, (“(a:)” means “for all x".) 

42.3. Show that the function /(a) of Exercise 40.1 (a) (often written 
” exP(a , a;)”) is not computable when P(a, x) = T(a, a, x). 

42.4*. Show that there is no algorithm for telling whether, for given i 
and j, (p£ci) and q:,(a) are the same partial function (i.e. whether, for each a, 
both functions have either the same value or no value). 

42.5*. Show that the computable partial function of Theorem l (B) 
cannot be extended to a computable total function; i.e. there is no com- 
pletely defined number-theoretic function <p(i,a) such that: 

{i)(a)[(Ex)T(i, a, a;) — <p(i, a)=<p l (a)}, and <p is Turing computable. 175 

§43. Applications to formal number theory: undecidability 
(Church) and incompleteness (Godel’s theorem). Recall that in the 
foregoing we have been using “decision procedure” and “decidable” (i.e. 
"a decision procedure exists”) in relation to some given countably infinite 
class of questions (§ 40). Thus in Theorem I (A), the questions are “Is 
T(i, a, x) true?” for the various values of /, a, x (/, a,x — 0, 1, 2, . . .); and 
in Theorem III the questions are “Is (Ex)T(a, a , ar) true?” for the various 
values of a (a = 0, 1, 2, . . .). In Theorem IV below, we shall infer from 
Church’s theorem (Theorem III) that the decision problem fo r the system 
N of formal number theory of § 38 is “unsolvable” (or is solved in the 
negative); here the Questions are “Is A provable in N?” where A ranges 
over all the formulas of N. In Theorem II, which concerns “computation 
procedure” and “computable”, the questions are “What is the value of 
V(a)?” for the various values of a (a = 0, 1, 2, . . .). 

It should also be recalled that “decidable” or “computable” have both 
a vague intuitive sense (§40), and an exact sense whi c h we h a v e, d e fined 
via Turing machines (§ 41)., The Church-Turing thesis, and its converse 
that every Turing computable function's intuitively computable, assert 

1,5 Using recursiveness, with <!>(/, a) corresponding to the present the solution 
is in IM p. 341 following Theorem XXII. 
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that these two senses are equivalent. We understand our theorems now to 
be proved using the second sense, which requires more of us when we are 
establishing decidability or computability, and is the only sense in which 
they can be proved when we are establishing undecidability or uncomput- 
ability. But by the Church-Turing thesis, the latter theorems have their 
significance in terms of the intuitive sense of those terms. 

We are now in a position to achieve our goal (§ 40) of showing there is 
no decision proce dure for N (Theorem IV). Continuing The analysis a 
little further, wtTshall obtain G6dcl\Janious_|ncoinplctencss theorem in 
a generalized version, as well as his second theorem (in § 44), which will 
clarify the situation described at the end of § 38. 

We defined the predicate T(i, a, x) (§42) informally in quite elementary 
terms, though its definition fully written out (with our explanation of 
Turing machine tables and the indexing of them) is long. It would therefore 
be disappointing if T(i, a, a;) could not be expressed in the symbolism of N. 
Indeed, if it could not, the first part of our claim of the adequacy of N 
for the usual elementary number theory (end § 38) would be false. For, 
though 7 ’(/, a, x) is not commonly included in number-theoretic texts, 
there is nothing in its definition to prevent it from being included. In fact, 
the results cited in § 38 do enable us by established methods to find 
a formula T(i, a, x) (containing free exactly three distinct variables i, a, x) 
in the symbolism of N which expresses T(i, a, x) under the intended inter- 
pretation of the symbols. 176 The like, as we have just argued, would have 
to be the case for any formal system N which we would consider adequate 
for the usual elementary number theory. 

In N, the particular natural numbers 0, 1,2,... are expressed by the 
respective terms 0, (0)', ((0)')', . . . (which in § 38 we abbreviated “0”, “1”, 
“2”, . . .); these terms we call the numerals ( for the respective natural 
numbers 0, 1,2,.. .). For any natural number a, we denote the numeral 
for a by "a". Now, for each a = 0, 1 , 2, .... the proposition ( Ex)T(a , a, x) 
is expressed under our interpretation of the symbolism of N by the formula 
3xT(u, a, x). Call this formula “C, Similarly, in any adequate formal 
system of elementary number theory, given any value of a, we can find 
effectively (end § 40) a closed formula C tt expressing under the intended 
interpretation the proposition ( Ex)T(a , a, x). 


j: “ To (five somewhat more specific references for this than in Footnote 155 § 38, one 
can show" by 1M that T(i, a, s) is primitive recursive (Chapter IX), using the technique 
of § 69, and then apply Corollary Theorem I p. 242. The predicate written “(£)/) 7',(.r, x,y)" 
in Chapter XI pp. 281 If. plays a role for general recursiveness analogous to that of the 
(Ei)T(a, a, J-) of this book for Turing computability. (In some papers “7V’is written 
simply the notation goes back to Klcene 1936, which slightly antedates Turing 

1936-7.) 
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If, for a given a, the proposition ( Ex)T(a , a, x) is true, then we can prove 
it informally, by the mechanical process of exhibiting the computation 
steps of the machine 9Jt 0 applied to a up to the moment x at which a value 
will have just been computed. This will prove T(a, a , x) for that x , and 
(. Ex)T{a , a , x) will follow by informal 3-introduction. Now this informal 
proof can also be carried out within N (i.e. it can be “formalized in N”). 
Thus 

(a) (Ex)T(a, a,x)^{bC a in N}. 

Of course to show this would require a detailed investigation of the proof 
theory of N, which we are not giving in this book. But if it were not the 
case, the deductive apparatus of N (i.e. the list of its axiom schemata, 
axioms, and rules of inference) would be inadequate for the usual elemen- 
tary number theory, contrary to the second part of our claim in § 38. 177 

Now let us assume about N that in it only tr ue form ula s are provable. 
Since under our interpretation C a expresses (Ex)T(a, a, x ), this gives in 
particular 

(b) { f- C a in N} ->■ (Ex)T(a, a, x). 

In the next section we shall see why at this stage we simply assume (b). 
Clearly, if (b) were not so, we should reject N as a formal system for 
number theory. We certainly believe (b); indeed a proof of sorts, though 
not a finitary one (§ 36) and thus not one in metamathematics, is available, 
as follows. Under the usual interpretation, the axioms of N are true; and 
each of the rules of inference, when applied to one or two true formula(s) 
as premise(s) produces a true formula as conclusion. Thus all provable 
fo rmu las are true. Hence, (b). (Cf. § 38 (B).) 

Theorem IV. There is no decision procedure for provability in the 
formal system N o/§ 38; or briefly, N is undecidable. 

More generally, th is applies not just to the formal system N o/~ § 38, but 
to any formal system N in which, to each a, there can be found effectively a 
closed formula C a such that (a) and (b) hold. 

Proof. Suppose there were a decision procedure for provability in 
N. Then we could, given a, decide as follows whether ( Ex)T(a , a, x) or not, 
which would contradict Theorem III. Given a, find (as we can effectively) 
the formula C a , and apply the assumed decision procedure for provability 
in N to settle whether this formula C a is provable. By (b) and (a), according 
as C a is or is not provable, ( Ex)T(a , a, x) holds or does not hold. — 

177 If the method of finding T(i, a, x) proposed in Footnote 176 is used, then (a) will 
follow by IM Corollary Theorem 27 p. 244 and 3-introduction. 
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Preparatory to the next theorem, we further apply o ur suppositio n th a t 
in N only t rue formulas are p ro vable to th e formulas “ iC„ (a — 0, 1, 2, . . .). 
Thus we assume (where “ expresses “not”) 165 

(e) { f- -iC a in N} -> (Ex)T(a, a, x). 

The same remarks we made about (b) apply to (c). 

We now ask whether we can prove -iC ,, when it is true: that is, whether 
the converse of (c), namely 

( *) {Ex)T{a, a, x) -> { f -nC„ in N}, 

also holds. If it did, we should have a decision procedure for ( Ex)T(a , a, x) 
as follows, again contradicting Theorem III. First, observe that all the 
proofs in the N of § 38 can be written on a typewriter with the 41 formal 
symbols of N and the comma to separate successive formulas in a proof, 
making 42 symbols in all. So the proofs are enumerable (say using the 
method of digits, end § 32), and indeed effectively (since there is a decision 
procedure for being a proof, § 40). So, given a, we could search through 
an enumeration of the proofs in N for one of C„ or of ~ iC„. By th e classical 
law of the exclud ed middle with (a) and (*)- we shall find one or the other. 
By (b) an d (c l . according to whic h we. fi nd we shall learn that (Ex)T(a, a, x) 
js jrue or not true . 

Thus (*) does not hold for all a, which gives us the next theorem. 

Theorem V. In the system N of § 38 there is a closed formula C„ such 
that (i) — iC p is true, (ii ) not I — iC„ in N , and (iii) not P C v in N. 

More generally , this applies to any formal system N in which, to each a, 
there can he found effectively a dosed formula C a ( expressing ( Ex)T(a , a, x)) 
such that (a)-(c) {or (b)-(d) below) hold. 

First proof, completed. Since (*) does not hold for all a, there is 
some number p such that {Ex)T(p,p, x ) (i.e. (i), by what -iC, expresses) 
but not I- -iCj, in N (i.e. (ii)). By {Ex)T(p,p, x) with (b), then not h C v in 
N (i.e. (iii)). 

Remarks. This gives us Godel’s famous incompleteness theorem 
1931, generalized to apply to all formal systems N satisfying very general 
conditions, and with the “formally undecidable sentence” C p expressing 
the value of a preassigned predicate ( Ex)T{a , a, x) for an argument p 
depending on the particular system. This generalized form of Godel’s 
theorem (with a predicate written “(Ex)T(a, a, x)”) is due to Kleene 
1943. 178 

% 

1,8 Kleene used “g enera L recursive function s” (§41) instead of Turing machines. The 
first use of Church’s thesis to give a generalized version of Godel’s theorem was by 
Kleene in 1 936. ~~ — 
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In a formal system with notation like that of N, we say a formula E is 
formally decidable if E E or E ~iE; and we say the system is simply com- 
plete, if every dosed formula E is formally deddable. Thus by (iii) and (ii) 
of Theorem V, N is simply incomplete, with C„ as an example of closed 
formally undecidable formula. This notion of “formal decidability” applies 
to a particular formula, while decidability in the sense of §40 or of §41 
applies to an infinite class of questions (or propositions). We restrict the 
E in the definition of simple completeness to be closed, because e.g. we 
would not wish to have E 2|« or I — 12|^ (cf. § 38). For, under the 
generality interpretation (which applies to free variables of provable form- 
ulas, § 38), 2 | a as a provable formula would say “all natural numbers 
are even” and — i2j<^- would say “all natural numbers are odd”. We like- 
wise confine the definition to systems with notation like that of N, excluding 
systems like the propositional and predicate calculi, because e.g. neither 
E P 3 Q nor E ~i(P D Q) in those calculi. Although PDQ is a closed 
formula, its atoms P and Q function in the definition of validity like vari- 
ables ranging with the generality interpretation over all propositions. 

The foregoing proof of Theorem V is indirect, the existence of the p 
being inferred from the absurdity of (*) holding for all a. We now give a 
direct proof. 

Second proof of Theorem V. Let 901,, be a Turing machine which, 
applied to a as argument, searches through an enumeration of the proofs 
in N for one of ~iC 0 , and if one is found writes 0, but otherwise never 
computes a value (indeed, never stops). Considerations employed in similar 
situations above should make it clear that such a machine 311,, exists. A 
detailed proof for the particular N of § 38 is available on the basis of the 
developments in Turing-machine theory to which we referred late in § 41 
and early in § 42. From what the machine 90l„ does, with the definition of 
T(i, a, x), 

(d) (Ex)T(p, a, x) = { E -iC„ in N}. 

Now (b)-(d) give us the three parts of Godel’s theorem, thus. 

(i) Suppose ( Ex)T(p,p , x); then by (d) 

Thence by (c) 

(Ex)T(p,p, x). 

This contradicts {Ex)T(p, p,x)\ so by reductio ad absurdum, 

(Ex)T(p, p, x), i.e. -iCp is true. 

(ii) By (i) "iC, is true, i.e. ( Ex)T(p,p , x). Thence by (d) 

not E„ -iC,,. 
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(iii) Suppose then by (b) 

(Ex)T(p,p, x), 

i.e. C„ is true. This contradicts (i); so not f N C„. 

Discussion. Here we have used the feature of formal systems, essential 
to the purposes which they arc intended to serve (§§ 36, 37), that a proof 
of a formula can be effectively recognized as being such (and also that C„ 
can be effectively found from a). Without this feature, we would have a 
trivial counterexample to Theorem V by taking all the true closed formulas 
as the axioms of N. With it, by Church’s thesis we conclude that, for any 
such system, an exists. 179 Here the computability notion can be 
applied directly to the linguistic symbolism of the system, or that sym- 
bolism can be converted to natural numbers, e.g. by the method of digits 
with Footnote 173, as we have already done with machine tables. The 
numbers correlated to the linguistic objects are called Godel numbers , and 
the correlation is called a Godel numbering, after Godel, who introduced 
this device in 1931. 180 

The application of Church’s thesis, by which we obtain Theorem V for 
all systems N, can be avoided for a particular system by actually construc- 
ting the 9J{„ for it. This in effect Godel did in 1931* in proving his theorem 
for a particular system before Church’s thesis had appeared (1936); 181 and 
this, as we have implied, can be done by known methods to get the theorem, 
without leaning on Church’s thesis, for the particular system N of § 38. 

Substituting p for a in (d) and negating both sides (as we already did 
in proving (.ii)), 

(d) ( Ex)T(p,p , x) = {not h “iC,, in N}. 

Remembering that under the interpretation “iC„ expresses ( Ex)T(p,p , x), 
we thus have that iC„ is a formula which, under the interpretation, expresses 
(a proposition equivalent to) its own unprovability. This was the point of 

179 “'In particular, the nature of the intuitive evidence for the deductive processes 
winch arc formalized in the system plays no role. 

“Let us imagine an omniscient number theorist, whom we should expect, through his 
ability to see infinitely many facts at once, to be able to frame much stronger systems 
than any we could devise. Any correct system which he could reveal to us, telling us how 
it works without telling us why, would be equally subject to the Godel incompleteness.” 
(KJeene 1943 p. 65.) 

1 '" Godel used another method of numbering in 1931 ; and Hilbert and Bernays 1939 
and IM use stdl another method. In 1931-2, Godel used what we are calling the “method 
of digits”. This type of numbering is studied in detail in Smullyan 1961, which gives a 
fresh approach to formal systems from ideas of Post 1943. 

In the period immediately after Godcl’s results appeared, there was some un- 
certainty among logicians whether there might not be an escape from them by using 
some other particular system quite different in its details from the system Godel used. 
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departure in Godel’s original proof, or at least in the heuristic explanation 
he gave for it; namely, he constructed a formula expressing its own unprov- 
ability. This is very close to the paradox of the Liar (§ 35), where we have 
a sentence expressing its own falsity. Only now, by Godel’s substitution of 
• 'unprovability” for “falsity”, there is a way out. We wish (and believe in 
the case of N) that all provable formulas are true. Then, if all true formulas 
were provable, we would have “unprovablc = false”, so we would have 
the paradox of the Liar. The way out now is that not all true formulas are 
provable; in particular. ~iC., is improvable though true . 182 

The first part of Hilbert’s program ( §§ 36, 37) called for formalizing -j- 
nu mber theory, analysis, and a suitable part of set theory in a forma l v 
syste m S. GbdcTs theorem shows this cannot be done completely ev en for 
n umber theory . For -iC p expresses a number-theoretic proposition which 
by the theorem is true, yet unprovable, provided of course S satisfies the 
hypotheses for the N of the theorem. However these hypotheses are con- 
nected with the purposes for devising formal systems, so we have no 
prospect of escaping them. These hypotheses are simply consequences of 
the st ructural feature of for mal systgjps discussed above and of the 
supposition that N is adequate and correct for some elementary number 
theory. 

We do not consider that Theorem V means we must give up our emphasis 
on formal systems. The reasons which make a formal system the only 
accurate way of saying explicitly what assumptions go into proofs are 
still cogent. Rather, Theorem V indicates that, contrary to Hilbert’s pro- 
gram, the path of mathematical conquest (even within the already fixed I ' 
territory of arithmetic) shall not consist solely in discovering new proofs 
from given axioms by given rules of inference, but also in adducing new ' 
axioms or rules. There remains the question whether mathematicians can 
agree on the correctness of the new axioms or rules. T 

In Theorem V, no sooner are we aware that — iC„ is unprovable than we 
also know that - iC p is true, so we can extend N (call it “N 0 ”) by adding 
-iC„ as a new axiom. But then Godel’s theorem will apply to the extended 
system N 1; and we shall have in this system a true but unprovable formula 
-iC v 1 he process can be repeated to obtain successively stronger. formal 
systems N 0 , N x , N 2 , . . .. We can combine these to form another formal 
system, if these systems are formed systematically enough so that after they 
are combined it will be decidable which formulas are axioms, and hence 

i 82 Nagel and Newman 1956, 1958 give a popular exposition of Godel's theorem along 
the lines of Godel's original proof 1931. They give a misleading impression that the 
generalized Gddel theorem is implied by Godel’s 1931 reasoning and thus without the 
Church-T uring thesis. 

In Popper 1954, Theaetetus explains Godel’s theorem to Socrates. 
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which finite sequences of formulas are proofs (otherwise we could not 
consider the result a formal system). Then, starting from this system, we 
can again use the extension process based on Godel’s theorem. Proceeding 
in this way does not provide an escape from the consequences of Godel’s 
theorem. 183 


§ 44. Applications to formal number theory: consistency proofs 
(Godel’s second theorem). In establishing (i) in Theorem V we have 
proved that -iC„ is true, though by (ii) ~iC P cannot be proved in N. It is 
illuminating to consider wherein our intuitive proof of (the truth of) — lC,, 
^transcends the resources of N. 

In our second proof of (i) in Theorem V we used (c), which we regarded 
as an assumption. Using (a), this assumption (c) can be derived from the 
assumption that N is simply consistent, i.e. that for no formula E do both 
f E and b -iE hold in N (Exercise 44.1 (a)). 

Alternatively, we can recast the second proof of (i) to use simple consis- 
tency directly instead of (c), thus. 

(i) Suppose ( Ex)T(p,p , x)\ thence by (a) and (d) 

and fjj ~ tCj,, 

contradicting the simple consistency. So by reductio ad absurdum, 
(Ex)T(p,p, .r), i.e. -iC„ is true. 

In either version of the second proof of (i), the part written out in detail 
is entirely elementary; indeed, it is only informal use of the predicate 
calculus. Also the reasoning by which (a) and (d) can be established for 
the N of § 38, or a similar particular system, is elementary, on the level of 
informal number theory, though it is quite long when executed in full 
detail. There remains (c) or the simple consistency as the sole component 
not known to be elementary. 

In summary, our informal proof of (i) is entirely elementary except for 

Classes of such systems are called "ordinal logics” because the systems are indexed 
by finite and infinite (or “transfinite”) “ordinal numbers”. Ordinal logics have been 
studied by Turing 1939, Feferman 1958 abstracts, 1962, and Kreisel 1958c. 

The familiar natural numbers serve both as finite cardinals (e.g. "There are 10 houses 
on this street.") and as finite ordinals (e.g. “He lives at 10 Downing Street.’ ). Cantor 
gave a theory in which ordinal numbers can be defined as equivalence classes of “well- 
ordered” sets under the relation of admitting a 1-1 correspondence preserving the 
ordering. A well-ordered set is a "linearly ordered” set each nonempty subset of which 
possesses a least element, A linearly ordered set is a set S together with an “order” rela- 
tion < which is irretlexivc, transitive, and total (for each a and b in S, 
a < A V a = /) V a > h). Under an obvious definition of < for ordinal numbers, the 
ordinal numbers themselves arc well-ordered. 

Ordinal logics use the theory of ordinal numbers in a constructive or computable 
version, due to Church and Kleene 1936, Church 1938 and Kleene 1938. 
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our having to use in it (c) or the simple consistency of N. Thus it is indi- 
cated that (c) or that simple consistency is the respect in which it trans- 
cends N. 

Preparatory to formulating this more carefully, we note for reference 
that the following can be shown in an entirely elementary way: 

(1) {N is simply consistent} -*• ( Ex)T{p , p , x). 

The proposition that N is simply consistent can be expressed in the 
symbolism of N. Let "JJi r be a Turing machine which, applied to any num- 
ber a , searches through an enumeration of the proofs in N fora proof of a 
formula of the form E & ~iE, and if it finds one writes 0, but otherwise 
never computes a value. The consistency of N is equivalent to {Ex)T(r, a, x), 
for any a, and in particular to (' Ex)T(r , r, x), which is expressed in N by 
-iC r . Call this formula “Consis”. 184 

Now the informal implication (1) can be translated into the symbolism 
of N as the formula 

Consis 3 ““ iC^, 

since -iC„ expresses ( Ex)T{p , p, x). 

Theorem VI. (Godel’s second theorem.) In the system N of % 38, not 
t- Consis; i.e. the formula Consis expressing in N the consistency of N is 
improvable in N. 

More generally, this applies to any simply consistent formal system N, 
with any choice of a formula Consis expressing in N the consistency o/N, 
such that there can he found effectively, to each a, a closed formula C,„ and 
a number p, for which (a) and (d), and (2) below, hold}** 

Proof (completed). Because (1) is established by elementary in- 
tuitive reasoning, our belief in the adequacy for elementary number theory 
of the system N of § 38 gives us reason to believe that the formula 
Consis 3 "fC,, expressing (1) can be proved in N, i.e. that the informal 
proof of ( 1) in elementary number theory can be formalized in N. A person 
hating some familiarity with the development of number theory in N 

1,1 Alternatively, we can correlate Godel numbers to the objects in the symbolism 
of N (end §43), and take as Consis a formula which directly translates the consistency 
property into a statement about Godel numbers. Cf. 1M p. 210. 

So much has been put into the hypotheses of the generalized version of this 
theorem (the second paragraph) that it only amounts to an application of modus 
ponens to (2) with the fact that simple consistency, (a) and (d) imply Theorem V (ii). 
The significance of this second paragraph is that the hypotheses arc ones that would be 
satisfied by any system of number theory, and any choice of Consis for it, that we would 
normally consider. A f uller treatment of the domain of applicability of Godel s second 
theorem was first given by Fefcrman 1960. 
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(further than wc went in § 38) would find it hard to doubt this. Hilbert 
and Bernays did verify it to be the case. 186 That is, 

(2) f N Consis 3 ~iC„. 

Now suppose h N Consis. By 3-elimination (modus ponens) from (2), then 
h N — iC„, contradicting (ii) of Theorem V. , 

i Discussion. The second part of Hilbert’s program for foundations 
called for showing by finitary metamathematical reasoning that a formal 
system chosen as a formalization of classical mathematics is consistent. 
As the mathematics formalized in N § 38 is not all finitary, the hope must 
have been that the part of the methods formalized in N obtained by 
excluding the nonfinitary ones would suffice for the consistency proof. 
Godel's second theorem shows that not even all the methods formalizable 
in N, i.e. a metalanguage isomorphic to N itself, can prove the consistency 
of N, if N is consistent (as we have been assuming). 

Some mathematicians judged that this ended forever the hope of getting 
Ta guarantee for classical mathematics by a metamathematical consistency 
proof. 

Others thought it possible that methods could be found which could be 
considered as Unitary even though not formalizable in N. Then the set of 
, the finitary methods F and the set of the methods formalized in N could be 
pictured as overlapping circles; the law of the excluded middle for count- 
‘ ably infinite sets (§ 36) would be in N but not in F, while some new finitary 
. method would be in F but not in N. 

I Progress toward proving the consistency of N had been stalled since the 
consistency proof for a subsystem of N by Ackermann in 1924-5 (end § 38). 
Interesting new proofs, but still for essentially the same subsystem, were 
given by von Neumann in 1927, Herbrand in 1931-2, and Gentzen in 
1934 - 5 . After the necessity of using some nonelementary method was 
revealed by Godel’s second theorem in 1931, it was not too long before 
Gentzen in 1936 gave a consistency proof for N.’This employed, as its 
method not formalizable in N, induction over a certain segment of the 
finite and transfinite ordinal numbers, which Cantor had obtained by an 
extension of the counting process or ordinal use of numbers beyond the 
natural numbers, in association with “well-ordered” sets (analogously to 
his introduction of transfinite cardinal numbers in association with un- 
ordered sets, § 34 ). 183 The induction was over the ordinals less than the 
ordinal called “s 0 ” by Cantor. 187 A different consistency proof for N was 

Hilbert and Bernays 1939 pp. 283 ff., especially pp. 300-324. Their work is done in 
a system essentially equivalent to the N of § 38, though using a somewhat different 
choice of Consis and — iC„ than the ones we are basing on Turing machines. 

187 For a little more of an indication, cf. IM pp. 476-478. 
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given by Ackermann in 1940, also using transfinite induction over the 
ordinals < e 0 . Schiitte 1960 gives such a proof in a new, quite perspicuous 
form. Denton and Dreben 1968 greatly simplify the Ackermann proof by 
using ideas from Herbrand 1930. 

It is a rather subjective matter whether this should make us feel safer 
about N than we already felt on the basis of its axioms being true, and its 
rules of inference preserving truth, under an interpretation (“truth defini- 
tion’’) that as classical mathematicians we presumably accept (§ 43 following 
(b)). By a simple reduction of classical logic to intuitionistic logic given by 
Godel 1932-3, Gentzen 1936 and Bernays, the consistency proof by a 
truth definition can even be managed intuitionistically . 188 Tarski, asked 
whether he felt more secure about classical mathematics from Gentzen’s 
consistency proof, replied, “Yes, by an epsilon”. (Calculus students will 
recognize epsilon “e” as commonly used for a small positive number.) 

It is clear that the consistency proofs by induction over the ordinals less 
than the ordinal e 0 work very hard to accomplish something, but less clear 
what. 

^re iseU 1951-2, 1958) finds the sig nificance of the consistency proo fs by 
transfinite induction up to e 0 in by-products. Suppose that, for a given 
natural number /, the formula Va3xT(i, a, x) is provable, where / is the 
numeral for i. Then, assuming that in N only true formulas are provable, 
(. a)(Ex)T(i , a, x) holds, and thus is a computable total function (cf. 
§ 42). It is not hard to see that the computable (total) functions which can 
thus be proved to exist in N constitute a proper subclass (i.e. not all) of 
the computable functions (Exercise 44.2). Indeed Kleene 1936 gave a proof 
of Godel’s incompleteness theorem from this idea. 178 Kreisel however 
extracts from Ackermann’s consistency proof 1940 a different characteriza- 
tion (not directly from N) of this subclass of the computable functions. 
The possibility thus appears that some true formula Va3xT(i, a, x) might 
be shown to be unprovable in N because for that i the computable function 
9, (a) is not in this subclass. 

Gentzen had already claimed in the first (1936) version of his consistency 
proof to have established a property of provable formulas of classical 
formal number theory N that can be regarded as an intuitive interpretation. 
But the property was complicated, and received little attention after his 
proof appeared in another version 1938a easier to follow and not adducing 
the property. 

To see that there is a problem of interpretation, we recall from § 36 that 
Hilbert 1926, 1928 made a distinction between “real” statements having 
a clear intuitive meaning, and other statements called “ideal”. In classical 
mathematics, the “ideal” statements are adjoined to the “real”. One 

199 Cf. IM § 81. A related reduction was given by Kolmogorov 1924-5. 
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T might have supposed that the “real” statements would include all the 
/ statements of elementary number theory. 

However the picture has not turned out to be this simple. For, in elemen- 
tary number theory there are statements proved classically that are not 
true on the basis of their meanings for an intuitionist. Kleene 1943 argued 
this as follows. 

\ The intuitionist understands an existential statement ( Ey)P(y ) to mean 
that one can actually find a y such that P(y). From this standpoint, what 
can (a)(Ey)P(a, y) mean? Only that there is an effective procedure by which, 
' given any a , one can find a y such that P(a, y). By the Church-Turing 
thesis, this must mean that the y is a computable function of a. Thus we 
' are led to the thesis that intuitionistically ( a)(Ey)P(a , y) only if there is a 
computable function g(a) such that ( a)P{a , g(a)). 

By the classical law of the excluded middle (which intuitionists decline 
to affirm), for each a, ( Ex)T{a , a, x) V ( Ex)T{a , a, x ). 

Thence [(Ex)T(a, a, x) & 0=0] V [{EE)T(a, a, x) & 1 = 1], 

Thence ( Ey)[(Ex)T(a , a, a) & y=0] V (Ey)[(Ex)T(a, a, x) & y=\\. 

Thence (Ey){[(Ex)T(a, a, x) & y=0] V [(Ex)T(a, a, x) & y— 1]}. 

This is for each a, so we have proved classically 

(a) {a)(Ey){[{Ex)T{a, a, x) & y=0] V [(Ex)T(a, a, x) & ?/=l]}. 

We have presented this proof informally, but it is a simple matter to formal- 
ize it in the system N of § 38, so that we have 


(P) h N Va3y{[3xT(a, a, x) & y=0] V [-i3xT(a, a, x) & y= 1]}. 


Let us abbreviate (a) as ( a)(Ey)P{a , y). By the above thesis, (a) holds 
intuitionistically only if {a)P(a, g(a)) for some computable function g(a). 
But from whatP(a, y) is in this example, the only g(a) for which (a)P(a, g(a)) 
holds is the representing function of ( Ex)T{a , a, x)\ and by Theorem III 
§ 42, this g(a) is not computable. 

Summarizing: (a) holds in classical informal number theory, and trans- 


— lates into a formula which (as ((3) states) is provable in N, but (a) cannot 
be affirmed as true intuitionistically. 

v Specker 1 949 gave similar examples in which the statements (a) which 
(J- hold classically but not intuitionistically (if the above thesis of Kleene 1943 
is accepted) are instances of well-known theorems of analysis, e.g. the 
theorem that a bounded monotone sequence of rationals is convergent. 
For definiteness, take a monotone nondecreasing sequence. Using varia- 
bles n, b, m, n u n 2 ranging over the natural numbers, and writing/(0),/(l), 
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/( 2), ... for any sequence of rationals, the theorem can be stated thus: J 

V 

(a) («)/(«) </(n+l) (/ is monotone nondecreasing), 

and 

(b) ( Eb)(n ) f(n) <b (/is bounded), 

then 

(c) (»»)(£>!)(«, )(« 2 ){« i, n 2 >n-+ !/(«!) -/(n 2 )| < 1/2"*} (/converges). 

It should be reasonably obvious that the notion of Turing computability 
can be applied to functions /(«) with rational numbers as values, either 
directly or by talking about the natural numbers which index the rationals 
in some fixed enumeration of the rationals § 32. Specker gave a particular 
sequence/(0),/(l),/(2), . . . , for which the function /is Turing comput- 
able, ,8U and such that (a) and (u) hold but 

(C') (/m)(«i)(// 2 ){«i, n 2 > g(m) — |/(n 1 )-/(« 2 )| < 1/2’"} 

holds for no computable function g. Thus, for this particular f (a)-(c) are 
expressible in the elementary theory of natural and rational numbers (which 
is in the common part of the classical and intuitionistic languages of mathe- 
matics), the hypotheses (a) and (b) are true both intuitionistically and 
classically, but the conclusion (c) is true only classically. 

Kreisel’s proposal uses Ackermann’s consistency proof for N to corre- 
late to (a) on the basis of ((3) a considerably more complicated statement , 1 
which is both meaningful and true for a “finitist”. A “finitist”, if not an ! 
intuitionist, is at least of similar ilk. 190 

Formal systems can be set up to study the foundations of intuitionistic 
mathematics, as Heyting did in 1930, 1930a, though the intuitionists have 
always maintained on phil osophical grounds (from before Godel’s theorem 
was known) that such systems cannot be complete. (Cf. ends §§ 12, 25, 39.) 
Kleene, David Nelson and others have since 1941 been appl ying comput - 
able (or general recursive) funct ions to elucidate the diffe rence s between 
int uitionistic and classical formal systems. 1 81 

^Exercises! 44 1. Using (a), show that: (a) the simple consistency of N 
implies (c), and (b) conversely (using weak -j-elimination § 11). 

44.2. Show that h N Va3xT(t, a, x) does not hold for all computable total 
functions <p,(a). (Assume that only true formulas are provable in N.) 

44.3*. Assuming N simply consistent, show that there is a formal system 

l#> In fact, via indices in a standard enumeration of the rationals, / is primitive 
recursive IM Chapter IX. 

1,0 Introductions to these ideas of Kreisel are 1953, 1958. Kreisel’s current thinking 
on a broad range of foundational problems is in 1 965. 

181 Cf. IM § 82, and Kleene and Vesley 1965. 
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M of number theory which is simply consistent but in which not all prov- 
able formulas are even classically true under the usual interpretation. 11,2 

44.4*. Let S be a system satisfying the hypotheses of Godel’s second 
theorem (except not assumed to be simply consistent). Let T be S minus 
some of its postulates. What would have been an appropriate response by 
a logician in 1930 to the following report by one of his colleagues? In 1932? 

(a) "In S, I have proved S consistent.” 

(b) “In T, I have proved S consistent.” 

*§45. Application to the predicate calculus (Church, Turing). 

Thlorlm VII. There is no decision procedure for provability in the 
(pure) predicate calculus; briefly , the predicate calculus Pd is undecidable. 
(Church 1936a, Turing 1936-7.) 

Prook Theorem IV followed from Theorem III because the theory of 
the predicate (Ex)T(a, a, x) can be formalized in N to the following extent: 
to each natural number a, a formula C„ can be found effectively such that 
C„ is provable in N if and only if ( Ex)T(a , a, x ) ((a) and (b) in §43). 

Theorem VII will follow from Theorem 111 because the theory of 
(Ex)T(a, a, x) can similarly be formalized in simply the pure predicate 
calculus Pd (end § 39). Our argument that the theory of ( Ex)T(a , a, x) can 
be formalized in Pd will be in three parts, (a) On\y finitely many function 
symbols f„ . . . , I',, and (“nonlogicai”) axioms have to be added to the 
predicate calculus with the predicate symbol = to obtain a formal system 
S,. in which the theory of ( Ex)T(a,a , x) can be formalized to the extent 
required. ([ i) The system S A . can be transformed into an essentially equiva- 
lent system S with the symbolism of the pure predicate calculus and still 
only finitely many nonlogicai axioms, (y) The closed nonlogicai axioms 
of S can be regarded as assumption formulas for deductions in Pd, to 
which the deduction theorem can be applied. 

( y.) To outline the first part, we start with the fact that the representing 
function (§ 40) 

(0 if T(i, a, x), 

r(/, a, x) = 

I otherwise 

of the predicate T(i, a, x) can be defined by the last of a finite list of (prim- 
itive) recursive definitions, beginning with those of + and • (§§ 38, 41). The 
reader must take our word for this part of the detailed proof. 193 

Now we describe the system S A .. We start with the predicate calculus, 
using the symbolism of the system N of § 38 (i.e. we start with the system 

192 Solution in § 47 below. 

1,J A lair amount of technique such as is provided in IM Chapters IX, X and XIII 
is helpful in constructing this list of recursive definitions. 



§45 UNDECIDABILITY OF THE PREDICATE CALCULUS 261 

(20) of §39). To this we add function symbols to express the functions 
after + and • which are defined by the rest of the recursive definitions 
culminating in the definition of r. Say that altogether the (individual and) 
function symbols expressing 0, r are fi, . . . , f A . (so “f\” is a 

name for 0, “f a ” for etc.). As the axioms A)', . . . , A* u we add to the 
axioms of the predicate calculus, we take the last six particular 
axioms 16-21 of N, also the pairs of equations of the additional recursive 
definitions, and finally the open equality axioms for the function symbols 
f s , . . . , fi*. (§ 29). For f 3 (i.c. +) these arc the two formulas which were 
proved in N in § 38 Example 2 and Exercise 38.2 (but now they are to be 
axioms). 

In S k the equations expressing the values of the functions r 

will all be provable, as is illustrated for + by Exercise 38.4. Thus, when 
i, a, x are natural numbers such that T(i, a, x) holds, f*(i, a, .v)=0 will be 
provable in S, r So when (Ex)T(a, a, x) holds, we can in S*. prove the equa- 
tion f k (a , a , .r) = 0 for a suitable x, and thence by 3-introduction prove 
3xf A (a, a, x)=0; call this formula “D*”. Thus 

( 1 ) (Ex)T(a, a, x) -> { h D* in S A .}. 

We get an easy proof of the converse of (1) by applying the theory of 
valid consequence in the predicate calculus with functions (§§ 20, 28). For, 
similarly to (B) in § 38, if I- D* in S t „ then VA*, . . . , VA* t ¥ D*. Now take 
as the domain D the natural numbers {0, 1,2,.. and consider the assign- 
ment in D in which =, f,, f 2 , f 3 , f 4 , . . . , f fc are interpreted by the equality 
predicate =, the natural number 0, and the functions \ +, • r respec- 

tively. Under this assignment, VA^, . . . , VA* (< are all t, so (by 
VA*, .... VA*; ij; f D*) DJ is also t. But under this assignment, D£ being t 
means that ( Ex)T(a , a, x). Thus 

(2) { h D) in S A .} -> (Ex)T(a, a, x). 

This proof of (2) is not metamathcmatical (cf. § 37). A metamathematical 
proof can be given. 194 Combining (1) and (2), 

(3J { h D* in S A .} = ( Ex)T(a , a, x). 

(J3) There is a method by which, starting with S A ., we can replace succes- 
sively the function symbols f 7c , . . . , f 4 , f 3 , f 2 , f A expressing r, 0 

191 The metamathcmatical proof of (2), and the details missing in our outline of the 
argument, appear e.g. in Klecne IM Part IV, with certain results of Parts II and III 
(cf. p. 434 Remark 2), though with a different T(i, a, x) than the present one. (There 
the treatment is based on general recursiveness instead of on Turing computability; 
cf. ) 41 above.) 

Thus for Theorem VII we don’t need a nonclemcntary assumption, like the assump- 
tion of (b) for Theorem IV. 
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by respective predicate symbols F t , . . . , F 4 , F 3 , F 2 , F 4 expressing the repre- 
senting predicates of r, . . . , •, -f, ', 0 (cf. end § 38). We shall give the idea 
by discussing the step in which f 4 (i.e. •) is replaced, assuming that the 
function symbols f t ., . . . , f 5 have already been replaced by predicate symbols 
F A ., . . . , F 3 . (By giving just this step, we can illustrate the process, while 
keeping the notation simple.) 

So we suppose that, as a result of the previous replacement steps, we 
have a system S 4 in which the only function symbols are f 4 , f 3 , fi>, f, (i.e. 
•, +, ', 0) and which has only finitely many nonlogical axioms Aj, . . . , A^. 
Furthermore in S 4 , to each natural number a, there is a formula DjJ such 
that 

(3 4 ) { h D® in S 4 } = ( Ex)T(a , a , x). 

We seek a system S 3 with only the function symbols f 3 , fi>, f,, with only 
finitely many nonlogical axioms A®, . . . , A* la , and in which to each a 
there is a formula D* such that 

(3,) { h D® in S 3 } = (Ex)T(a, a, x). 

To obtain S 3 from S 4 we alter the symbolism by replacing the 2-place 
function symbol • by a 3-place predicate symbol which we shall also write • 
(i.e. we use • for both f 4 and F„). Thus in the formation rules, we omit the 
clause which says that whenever r and s are terms (r)-(s) is a term, but 
add a clause which says that whenever r, s and t are terms '(r, s, t) is a 
formula. For the interpretation, the formula -(r, s, t) of S 3 has the same 
meaning as (r)-(s)=t had in S 4 . That is, -(aj, ^expresses the representing 
predicate u-b = c of the product function o h (cf. end § 38). 

To get the axioms of S 3 from those of S 4 , we replace 20 and 21 by 

(A) •("* 0, 0), (B) 3r(-(«, (>, c ) & -(a, «+«)). 

We replace the two equality axioms for • in S 4 by 

(C) « = / 3 (•(", r, </) 3 '(fi, r, </)), (D) « = (•(*, a, d) 3 ■{{, 6, d)). 

We add the two axioms 

(E) // = ^3 (■(<■, d, a) 3 -(c, d, 0), 

(F) 3f(-(«, f, <■) & V4-(fl, d) 3 c=d)). 

Here (C)-(E) are the open equality axioms for the predicate symbol •> 
which express that •(«, b, cj is well-defined as a predicate; and (F) (abbrev- 
iated “3 !<••(*, /, <■)” in §29) expresses that, for given a and b, •( a , b, c ) 
is true lor one and only one e, i.e. that -(o, b, c) is the representing predicate 
of a function. Finally, we replace each other axiom of S 4 which has the 
function symbol • in it by the result of paraphrasing it to use the predicate 
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symbol •. Already (A) and (B) are such paraphrases of 20 and 21, and the 
method of paraphrasing is also illustrated at the end of § 38 for the func- 
tion symbol !. 

Now in the system S 3 which we have just constructed from S 4 , the same 
situation exists relative to the function ■ that exists in N of § 38 relative 
to functions like !. By the general theory cited in Footnote 156 §38, we 
can in S 3 carry out the reasoning about the function • via the paraphrases. 195 
So in this way, from (3 4 ) it can be inferred that (3 3 ) holds when D 9 is the 
result of paraphrasing to replace the function symbol • by the predicate 
symbol •. (In fact, D 1 will not contain •, so D® is D*. But D* -1 will differ 
from D£, D 1 from D“, and D® from D 1 .) 

The successive replacements (illustrated by the step from S 4 to S 3 ) lead 
to a system S 0 with no function symbols, but instead only predicate sym- 
bols, and with only finitely many axioms A 9 , . . . , A®„ o . In S 0 , to each a 
there is a formula D® such that 

(3„) {hD® in S fl } = (Ex)T(a,a,x). 

From S 0 we now pass to a system S having the notation of the pure 
predicate calculus Pd, by changing the predicate symbols = , F 1( . . . , F* 
of S„ to respective predicate letters P 0 , . . . , P* (used with the same respec- 
tive numbers of arguments). Let A 4 , . . . , A m (m — m 0 ) and D n result by 
this change in notation from A®, . . . , A^, o and D®; so Aj, . . . , A„, are 
the nonlogical axioms of S. A proof of D® in S 0 will clearly become a 
proof of D„ in S when the notation is thus changed (it will make no differ- 
ence to any step in the proof whether we are using the predicate symbols 
or the respective predicate letters). Thus 198 

(4) {b D® in S„} — * {b D„ in S}. 

The converse is not quite so simple, since a given proof of D„ in S might 
use some predicate letters other than P 0 , . . . , P*. (In S we have available 
infinitely many predicate letters with each number of arguments, but in 

S 0 only the k + 1 predicate symbols =, F 4 F*.) However a proof of 

D„ in S will remain one when the atoms in it formed using other predicate 
letters are changed to VxPi(x). The resulting proof of D a in S will become 
one of D® in S 0 when P 0 , . . . , P* are changed to =, F x , . . . , F*. Thus 196 

(5) { b D„ in S} -*- { b D® in S 0 }. 

1,5 The application of the results cited in Footnote 156 § 38 to replace a function 
symbol by a predicate symbol is discussed in general in Hilbert and Bernays 1934 pp. 
460-467 and IM pp. 417-419; and for the case of multiplication • (but not for exactly 
the system S,) in IM Example 1 1 p. 419. 

«* The implications (4) and (5) constitute simple applications of the proof-theoretic 
substitution rule cited in the second paragraph of Footnote 86 § 25. 
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Combining (4) and (5) with (3 (J ), 

(6) { h D a in S} = ( Ex)T{a , a, a:). 

(v) Now {h D„ in S} = {VAj, . . . , VA,„ I- D (1 in Pd} (similarly to (A) 
§ 38 p. 208) 3= ! h VA, Z) (VA, Z> . . . (VA,„ => DJ . . .) in Pd} (by Corollaries 
Theorems 1 1 ,,,, and 10, ,,,). Using this in (6), 

(7) { I- VA, D ( VA, => . . . (VA m 3 D„) . . .) in Pd] = (Ex)T(a, a, x). 

Now we complete the proof, as we planned, by applying Theorem III. 
Thus, suppose there were a decision procedure for provability in Pd. Then, 
given a , we could decide as follows whether (Ex)T(a, a, x) is true or false, 
contradicting Theorem III. From a, find the formula 
VA, O (VA, Z) . . . (VA„, DD,).. .). Apply the supposed decision proce- 
dure for provability in Pd to the question whether this formula is provable. 
By (7), according as the answer is “yes” or “no”, (Ex)T(a, a, x) is true 
or false. — 

An interesting new proof of Theorem VII (presupposing ideas given 
here in Chapter VI) is in Biichi 1962. 

The undecidability results made possible by the Church-Turing thesis 
(§41) arose first (like Theorem III) directly in connection with the new 
notions of /.-definability, general recursiveness and Turing computability, 
and next (like Theorems IV and VII) for decision problems for formal 
systems. 

Church on May 19, 1936, shortly after enunciating his thesis and 
obtaining his first results (Theorems III, IV and VII, particularly), wrote 
the author, “What I would really like to see done would be my results or 
yours used to prove the unsolvability of some mathematical problems of 
this order not on their face specially related to logic.” This hope was 
fulfilled, beginning in 1947 when Post and A. A. Markov (the younger) 
independently of each other showed the “word problem for semi-groups” 
to be unsolvable. 137 This led to the unsolvability of the “word problem for 
semi-groups with cancellation” in Turing 1950 (>vith Boone 1958), and 
thence to the unsolvability of the “word problem for groups” in a 143-page 
article by Novikov 19 5 5. 198 Simpler proofs of Novikov’s result have since 
been given from other directions by Boone in 1957 of 1954-7, and in 1959, 
and by Britton in 1958 with 1956-8, and in 1963, and (as a corollary of 
another theorem) by Higman in 1961. 199 In 1958 Markov established the 
unsolvability of the “homeomorphism problem for four-dimensional mani- 
folds” in topology. This is reworked and extended in Boone, Haken and 

A treatment is given in IM § 71. 

IW For the problem, see Delin 1912 p. 117. 

188 Britton's 1963 proofhas been worked into a textbook: Rotnian 1965 Chapter 12. 



§46 DEGREES OF UNSOLVABILITY, HIERARCHIES 265 

Poenaru 1968. Other major publications in this area include Rabin 1958, 
Clapham 1964, Shepherdson 1965, Boone 1966, 1966a and 1968 (with full 
bibliography). In the papers cited in bold face, the inquiry is refined to 
consider not just solvability vs. unsolvability but also “degrees of unsolv- 
ability” (§ 46). 

Unsolvability results have begun to appear in real-variable analysis: 
Scarpellini 1963; Richardson 1966 abstract (which treats a variant of the 
integrability problem of Exercise 40.1 (k)). 

Undecidability results have also been obtained in connection with ! 
grammatical pro blems for languages for use with computing machines and ' 
finite automat a: Rabin and Scott 1959; Bar-Hillel, Perles and Shamir 1961. . 

* § 46. Degrees of unsolvability (Post), hierarchies (Kleene, 
Mostowski). We can summarize our proofs of Theorems IV and VII as 
follows. First, in Theorem III, we established the undecidability of the 
predicate (Ex)T(a, a, x). Then we “reduced” the decision problem ( P ) for 
this predicate to the decision problem (0 for provability in N or in the 
predicate calculus Pd. 

To say this in more detail, take the case of Pd (Theorem VII). We showed 
that, if we had a way to answer any question of the class (Q) “Is a given 
formula E provable in Pd ?”, then we could answer any question of the class 
(P)“Does a given natural number a have the property that {Ex)T(a, a, x)T\ 
But in Theorem III we had shown from the Turing machine concept with 
the Church-Turing thesis that there can be no algorithm to answer all 
questions of the class ( P ). Therefore there can be none to answer all 
questions of the class (0. 

We reduced ( P ) to (0, because ( P ) is the class of questions we could 
show directly to be undecidable. However, it is of some interest to see 
that, inversely, (0 can be reduced to ( P ). This is included in the following 
proposition (and similarly with N in place of Pd). 

(A) To each predicate of the form (Ex ^ . . . (Ex m )R(a „ . . . , a n , x u . . . , x m ) 
where R(a u . . . , a n , x u . . . , x m ) is a given decidable predicate (m, n > 0), 
there is a computable function 6(a t , . . . , a n ) such that 

(Eltj) . . . (Ex m )R(a 1 , . . . , a ni Xi, . . . , # m ) — 

( Ex)T(6(a u a n ), 0(a „ . . . , a„), x). 

Similarly for n = 0, with 6(a u ...,a n ) becoming simply a natural number h. 

(Proof follows.) Thus, any question of the class “Is 

(Ex,) . . . (Ex m )R(a lt a n , x 1 xj true?” is reduced to answering 

a corresponding question of the class “Is ( Ex)T(b , b, x) true?”, namely the 
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question with b = 0(a u ...,«„) (where, since 0 is computable, from any 
, a n we can effectively find the corresponding b). 2m 

To prove (A), take any fixed a u . . . , a n , and consider the following 
intuitive computation procedure. First, suppose all w-tuples of natural 
numbers (x u . . . , x m ) have been enumerated, i.e. brought into an infinite 
list (§ 32). Now, using a Turing machine (supposed given) which decides 
the predicate /?(<*!, . . . , a n , x lt . . . , x,J, test the m-tuples (aq, . . . , xj in 
the order listed, searching for one which makes R(a ly . . . , a n , x lt . . . , x m ) 
true, and write 0 if such an w-tuple is found, but continue the search ad 
infinitum otherwise. 

Now we claim, and the reader will have to take our word for the details, 
that a Turing machine can be constructed which, applied to any number b 
as argument, carries out the above process and writes 0 when an w-tuple 
(a:,, . . . , x m ) is found for which R(a u a • • • » x m ) ' s true > but 

otherwise never computes a value. What machine we construct depends 
on the numbers a u . . . , a H we started with. We further claim that these 
various machines can be constructed so that their indices can be given by 
a Turing computable function 0(a lt . . . , a n ). Thus, by what the machines 
do and the definition of the predicate T(i, a, x) in § 42, 

(Ex,) . . . (Ex m )R(a ly a n , x u . . . , xj = (Ex)T(0(a u a n ), b, x) 

for all a a,„ b. Taking b = 0(a lt . . . , a,) in this gives the equivalence 

in (A). 

To apply (A) to see that the decision problem (Q) for provability in the 
predicate calculus Pd (or in N) can be reduced to that ( P ) for (Ex)T(a, a, x), 
we may first give Godel numbers to the formulas in the predicate calculus 
in an effective way (cf. § 43 Discussion). Now “A is provable” can be 
expressed as.(Fx)Pr(a, x ) where a is the Godel number of A, and Pr(a, x) 
is the decidable predicate which says that x is the Godel number of a proof 
of the formula with the GQdel number a. Now (A) applies with Pr as the 
R (with n = in = 1). 

Summarizing, by (A) with the proofs of Theorems IV and VII, each of 
our three examples of undecidable predicates (or unsolvable decision 
problems) is reducible to each of the others. 

In Cantor’s set theory, when we began comparing infinite sets under 
one-to-one 'correspondences, we discovered that not all infinite sets are 

2U0 Post in 1944 gave the first result of this sort, using a different predicate than 
(Ei)T(b, b, .<). (The earlier sections of Post 1944 are less technical than most papers 
written in this area.) The result for a predicate in the theory of general recursive functions 
analogous to the present (Ex)T(b,b,x) appeared in IM p. 343. Slightly stretching 
Post’s terminology, (Es)T(b,b,x) can be called a complete predicate for the class of 
predicates (hr,) ■ . . ( E-c m )R(a x m ). 



§46 DEGREES OF UNSOLVABILITY, HIERARCHIES 267 

equally numerous; indeed, we found a hierarchy of increasing cardinal 
numbers. Analogously, the question arises now whether all undecidable 
number-theoretic predicates are equally undecidable, in the sense that the 
decision problem for any one of them is reducible to the decision problem 
for any other. As in set theory, the answer is negative; indeed, some predi- 
cates are “more undecidable”, or (have decision problems) of ’^ higher 
degree of unsolvability”, than others. 

So far we have only-considered some particular pairs of undecidable 
predicates which are equally undecidable (the decision problem for either 
one being reducible in a simple way to that for the other). Now we need a 
more general notion of when one predicate (or its decision problem) is 
reducible to another. This requires an extension of Church’s thesis or the 
Church-Turing thesis to decidability or computability relative to a given 
predicate 0(6). 

The concept that we use here is due to Turing 1939, and its use to define 
“degree (of unsolvability)” to Post 1948 abstract (with some anticipation 
in 1944). 2 " 1 Consider a “0-machine”, which is like an ordinary or “abso- 
lute” Turing machine, except that it has access to a second (actually) 
infinite tape (a “0-tape”) on which are printed the answers to all the 
questions (for b~ 0, 1,2, . . .) whether 0(6) is true or false. (Turing 
instead spoke of a machine having access to an “oracle” that would answer 
any such question the machine asks it about 0(6).) If a suitable 0-machine 
can answer all the questions (for a = 0, I, 2, . . .) whether P(a) is true or 
false, we say (the decision problem for) P(a) is ( Turing ) reducible to (the 
decision problem for) 0(6). If P(a) is reducible to 0(6) and vice versa, we 
say P(ci) and 0(6) (or their decision problems) are of the same degree ( of 
unsolvability). If P(a) is reducible to 0(6) but not vice versa, we say P(a) 
is of lower degree than 0(6) or 0(6) is of higher degree than P(a). Hence, 
if P(a) is reducible to 0(6), P(a) is of the same or lower degree than 
0(6). 

Here lower, equal or higher degree between predicates P(a) and 0(6) 
corresponds to less, equally or more numerous between sets M and N. To 
put the theory in better form, we should define degrees themselves and 
not just use the word “degree” to express the above relationships between 
predicates P(a) and 0(6). Here we can use the same method as in defining 
“cardinal number” by the Frege-Russell method in §34. We begin by 
verifying (Exercise 46.1) that the above “equal-degree” relation “P(o) is 
reducible to 0(6) and vice versa” is an equivalence relation, i.e. it is 
reflexive, symmetric and transitive (like the “equally-numerous” relation 
“/V/ can be put into 1-1 correspondence with N'\ § 34). So we can define 

801 In IM pp. 314-315 the treatment is based equivalently on the theory of general 
recursive functions (relativized by Kleene 1943), instead of on Turing computability. 
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the device of any number-theoretic predicate P(a) to be the equivalence 
class to which P(a ) belongs under this equivalence relation (§ 30). Then to 
justify the above definition of when one predicate P{a) is of lower degree 
than another 0(6), which now becomes “the degree of P{a) is < the degree 
of 0(6)’", we need to verify (Exercise 46.2) that the result is independent of 
what particular predicates P(a) and 0(6 ) are used from their respective 
equivalence classes. 

The relation < between degrees is irreflexive and transitive (Exercise 
46.3). 

To keep the notation simple, we have spoken just now only of one-place 
number-theoretic predicates P(a) and 0(6). But the reducibility notion 
applies likewise to number-theoretic predicates or functions of any num- 
bers > 1 of arguments; and the equivalence classes can be taken in this 
larger totality. 

All the questions about particular values of a decidable predicate P(a) 
can be answered by a suitable absolute Turing machine. So a suitable 0- 
maehinc can answer all those questions, without looking at its 0-tape. 
Hence each decidable predicate P(a) is of degree < the degree of each pred- 
icate Q(b). The degree of any decidable predicate we write 0 (since each 
two decidable predicates are reducible each to the other, the decidable 
predicates do constitute a degree); this we have just seen is the lowest 
degree of unsolvability (“solvability”). 

By Theorem III, the predicate ( Ex)T{a , a,x) cannot be decided by an 
absolute Turing machine. So (Ex)T(a, a, x) cannot be decided by a 0- 
machine when 0(6) is a decidable predicate. For, the values of Q{b) on 
the 0-tape would give the 0-machine only unnecessary help; a suitable 
absolute machine could manufacture those values for itself. 

Hence ( Ex)T(a,a, x ) is of higher degree than the decidable predi cates. 
We write 0' (= 1) for the degree of ( Ex)T(a , a , x). Thus 0' > 0. 

Now it is easy to see how to proceed to higher degrees than O'. Though 
we have not spelled out exactly how the 0-tape of a.0-machine is used, 
it should be fairly obvious that we can do so, and then construct a predicate 
T Q {i,a,x) (relative to a given predicate 0(6)) which plays the role for 
0-machines which T(i, a, x) plays for absolute machines. Then (similarly 
to Theorem l (A)) T Q (i, a , x) is decidable by a 0-machine, and <pf(a) as 
a partial function of / and a is computable by a 0-machine. 

By using a 0-machine instead of an absolute machine in the proof of (A), 
we can establish : 

(B) For am' predicate Q, (A) holds when R(a t , . . . ,a„,x ,,..., x m ) is 
allowed to be any predicate decidable by a Q-machine, and T(a, a, x) is 
replaced by T (f (a, «, .r). 
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Similarly, paralleling with (2-machines the proofs of Theorems II and 
III, ( Ex)T<\a , a, x) is not decidable by a (2-machine. But since 
Q(a) = ( Ex)(Q(a ) & x=x), by (B) (2 is reducible to ( Ex)T\a , a , x). Thus 
(Ex)T%i, a , x) is of higher degree than Q. We can state this result in the 
following form. 

(C) If Q(b) is a predicate of degree d, then ( Ex)T Q (a , a, x) is of degree 
d' > d. 

Here by using the notation d' we include that the degree of ( Ex)T Q (a , a , x) 
depends only on the degree d of g; and furthermore that, when Q is 
decidable, the degree of (Ex)T Q (a, a, x) is the same as the degree of 
(Ex)T(a, a, x), which we originally designated as O'. These additional facts 
are easily proved using (B) and (A) (Exercise 46.4). 

Using (C) repeatedly, we get a succession of increasing degrees 
0 < 0' < 0" < . . . < 0 ( '‘> < . . 

It is easy to see that: 

(D) If P fa), P,(a), Pfa), ... are predicates of increasing degrees, and 
P{n, a) = P n (a), then P(n, a) as a 2-p/ace pred icate is of higher degree than 
any of Pfa), Pfa), Pfa ), .... 

(C) and (D) play a role in generating increasing degrees like (C) and (D) 
in § 34 in generating increasing cardinal numbers. This method of generat- 
ing number-theoretic predicates of increasing degrees is due to Davis 1950, 
and independently to Kleene and to Post. 

H ierarchies of number-th eoretic pred icates were first discovered from 
another point ~of view by Kleene in 1943 ( 1940 abstract), and independently 
(with a somewhat different method) by Mostowski in 1947. Using Kleene’s 
approach, we first establish: 

(E) To each decidable predicate R(a, x), there is a number f such that 

( Ex)R{a , x) ~ (Ex)T( f a, x). 

Similarly, to each decidable predicate . . . , a n , x), there is a number f 
such that 

(Ex)R{a l , ...,a n ,x)== ( Ex)T(f a x a,„ x) 

where T(i, a lt . . . , a n , x) plays the same role for the Turing machine com- 
pulation of n- place functions as T(i, a, x)for 1 - place functions. ( Enumeration 
theorem. ) 2U2 

50i (7;.i)'/ (0, a, .)■), (/•>) T( I , a, jc), (Ej )T(2, a, x ), ... is an enumeration with repetitions 
>f all predicates of the form (Er)R(a, <9 with R(u,x ) decidable (the “enumerating 
predicate” (Ex)TU. a, x) being of the same form except for the necessary additional 
■ ariable i). 
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To prove (E) for the case of one variable a, we need only let 9Jt, 
be a Turing machine which, applied to a, searches for an x such that 
R(a, a:) and writes 0 if one is found, but computes no value other- 
wise. 

(F0 The predicate ( x)T(a , a, x) is not expressible in the form ( Ex)R(a , x) 
with R(a, x) decidable. The predicate ( Ex)T(a , a, x) is not expressible in 
tlw form (x)R(a, x) with R(a, x) decidable. A fortiori , (x)T(a, a , x) and 
( Ex)T(a , a, x) are not decidable. 

For, suppose to the contrary that ( x)T{a , a, x) = ( Ex)R(a , x) for some 
decidable predicate R(a, x). Then 

(7) TV, a, x) = {EfiRfi, x) s (Ex)T(f a, x) (usiiig (E)) = (x)T(f a, x). 
Substituting/ for a, (x)T(ff x) s (x)T(ff x), which is absurd 
( 1 — 1 ( — iP ~ P) in the propositional calculus). 

Similarly, suppose that ( Ex)T(a,a , x) = (x)R(a, x) for some decidable 
R(a, x). Then (Ex)T(a, a,x) = ( x)R(a , x) = {Ex)R(a, x) = (Ex)T(f a, x) 
(using (E) for R(a, x) as its R{a, x)). Substituting/ for a, 

( Ex)T(f,f x) = (Ex)T(f f x), which is absurd. 

To infer that ( Ex)T(a , a, x) is not decidable, suppose that 
( Ex)T(a , a, a,-) = R(a) with R{a ) decidable. Then 

(Ex)T(a, a, x) = (a ;)(R(a) &x=x), which contradicts the second part of (Fj), 
since R{a) & x=x is decidable. Similarly or by ( x)T(a , a, x) = ( Ex)T(a , a, x), 
(x)T(a, a, x) is not decidable. 

(F.,) (Ex)(y)T(a, a, x, y) is not expressible in the form (x)(Ey)R(a, x, y) 
with ~R(a, x, y) decidable. ( x){Ey)T(a , a, x, y) is not expressible in the form 
(Ex)(y)R(a, x, y) with R(a,x, y) decidable. A fortiori, (Ex){y)T(a, a, x, y) 
and (x)(Ey)T(a, a, x, y) are not expressible using one or zero quantifiers 
applied to a decidable predicate. 

This is proved similarly to (Fi). Likewise we have propositions (F 3 ), 
(F 4 ), (Fj), . . . concerning the predicate forms with 3, 4, 5- . . . quantifiers 
(alternating between existential and universal) applied to decidable 
predicates.'-"* 

(F t ), (F.,), (F 3 ), ... are summarized in the hierarchy theorem of Kleene 
1943 (1940 abstract): 


Since RUi, ■••) is decidable, R(a, x) = ka. x/holds intuitionistically. Also 
(Er)T(f, a,s) = U)T(f,a,x) and (x)R(a, x) = (£>)R(a, x) are (informal) applications 
of *82a, which holds intuitionistically. Thus our proof of (F,) is good using only in- 
tuitionistic loeic. Classical logic is required for (F 2 ), (F 3 ), (F 4 ), . . .. 
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i 46 


(F) Consider the predicate forms 

(Ex)R(a, x) (x)(Ey)R(a, x, y) 
(x)R(a, x) {Ex)(y)R(a, x, y) 


m 


( Ex)(y)(Ez)R(a,x,y,z ) .. 

{x)(Ey){z)R(a, x,y,z) 


where in each form R is a decidable predicate. To each form after the first, 
there is a predicate expressible in that form but not in the form dual to it 
(i.e. arising from it by interchanging existential and universal quantifiers) 
nor in any of the forms with fewer quantifiers. 

Using a theorem of Post 1948 abstract (1M Theorem XI p. 293), it can 
be shown that the degrees of a decidable predicate and of the predicates 
(Ex)T(a, a, x), (x)(Ey)T(a, a, x, y), ( Ex)(y)(Ez)T(a , a, x, y, z), . . . are indeed 
0, O', 0", O ', . . . , the same as those of the predicates obtained by starting 
with a decidable predicate and using (C) repeatedly. 

Consider a formal system N (like that of § 38) in which, to each a, a 
formula can be found which expresses ( x)T(a , a, x); indeed since 
{ r)T(a , a, *) = (Ex)T(a, a, x), this formula can be iQ for the C„ of § 43. 
Say the Godel number of ~iC a is a(a) where a is a computable function. 
Using the predicate Pr (a,x) for N mentioned early in this section, 
{F — iC„ in N} = (Er)Pr(a(a), x). Here Pr(a(a), x) is a decidable predicate 
of a and x; write it simply R{a, *). Thus 

(e) { F -iC a in N} = ( Ex)R(a , x). 


Now it is immediate from the first part of (F t ) that N cannot be both 
correct and complete, so that -iC„ is provable in N if and only if “iC 0 is 
true, i.e. so that 


(J) { I — iC„ in N) = (x)T(a, a, x). 

For, combining (*) with (e) would give ( x)T{a , a, x) = (Ex)R(a, x), contra- 
dicting (F t ). Thus (*) cannot hold for all a. Assuming that N is correct, 

(c) { I — iC„ in N} — > (x)T(a, a, x), 


which is one of the implications in (J). So the other 


(*) 


(x)T(a, a,x)-*{ h ~iC„ in N} 


cannot hold for all a. Since (x)T(a, a, x) = (Ex)T(a, a, x), this (c) and (*) 
are the same as in §43, and by the correctness of N we again have (b). 
Thus (continuing as in the first proof of Theorem V), Godcl’s incomplete- 
ness theorem, for the system N of § 38 or any system N that is correct and 
meets the very general structural condition expressed by (e) holding for 
some decidable R, is implicit in the first part of (Fj). 
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This uses classical logic to infer the existence of a value p of a for which 
(*) is false. (Hierarchy theory is largely classical.) If (instead of assuming 
(*) for reductio ad absurdum) we apply (E) (as we did in proving (F,)), 
we get a number /such that 

(0 (Ex)R(f,x) = (Ex)T(ffx). 

Now 1 heorem V (i)-(iii) with /as the p follow intuitionistically from (b), (c), 

(e),(f). 

^ Church’s theorem (Theorem III) is the third part of (F x ). 

In brief, Churc h’s and Godel’s theore ms correspond to the two forms 
, R(a) and (Ex)R(a, x) in (F). From this point of view (emphasized in Kleene 
1943), the hierarchy theorem (F) can be regarded as a generalization of 
Church’s and Godel’s theorems. Under the other construction using (C), 
the hierarchy results by the iteration of Church’s theorem in a relativized 
version.'- 1 ” 

Post in 1944 raised the question whether any predicate of the form 
(E. r)R(a,x ) with R decidable has a degree (strictly) between 0 and O'. In 
1954, Kleene and Post showed that there exist predicates having degrees 
between 0 and O'; but their method did not show whether any of those 
predicates are of the form (Ex)R(a, x) with R decidable. In I ^fLEriedberg 
(U.S.A.) then pnly_20,yjyosj/d and Jvlucnik (U.S.S.R.) of similar age, 
independently of each other, refined the KdeemCPost construction to show 
that there is a predicate of the form (Ex)R(a, x) with R decidable whose 
degree is between 0 and O', solving Post’s 1944 problem.- 05 
There are incomparable degrees (i.e. degrees a and b such that neither 
a < 1) nor a = b nor a > b). The totality of all the degrees possessed by 
number-theoretic predicates, including degrees jumped over or bypassed 
in the hierarchies described above, has a very complicated structure. 200 

Exi.kcisi'S. 46.1. Show that the relation “P(a)' is reducible to Q(b) 
and vice versa” is reflexive, symmetric and transitive. 

46.2. Show that, if P^a) and Q x {b) have the same respective degrees as 
P(a) and 0(a), then {P(a) is reducible to Q(b) but not vice versa} = 
{Pi(a) is reducible to Qi(b) but not vice versa}. 

46.3. Prove that for any degrees a, b, c: (a) Not a < a. (b) If a < b and 
b < c, then a < c. 

2111 Lor a little more of an indication of hierarchy theory, cf. Kleene 1958 §2. 
Moslowski 1954 and Kleene 1955b give fuller but more technical expositions of results 
to that time. 

Lriedberg 1956 article concerning, 1957 (abstract 1956); Mucnik 1956, 1958. 

The first results to this effect were in Kleene and Post 1954. Much work has been 
done since by Spector, Lacontbe, Shoenfield, Sacks and others. Cf. Sacks 1963. 
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46.4. Show that: (a) If Q x and Q, z are of the same degree, so are 
( Ex)T Q <(a , a, x) and ( Ex)T Q *-{a , a, x). (b) If Q is of degree 0, then 
(Ex)T Q {a, a, x) is of the same degree as (Ex)T(a, a, x). 

46.5. Prove a statement like (E) with universal instead of existential 
quantifiers. 

46.6. Show that for any decidable R, 

(Jj) - ■ ■ (xJR(a u „ . . . , *,„) is of degree < O'. 

46.7. Establish (D) and (F 2 ). 

46.8*. Show that each of the predicates expressible in the symbolism 
of N § 38 under the usual interpretation (which were called “arithmetical” 
by Godel 1931) is expressible in one of the forms of (F). 207 

46.9*. Assume the fact that every decidable predicate is expressible in 
the symbolism of N § 38. 208 Thence establish: 

(a) The converse of Exercise 46.8. 

(b) For any fixed effective Godel numbering of the formulas of N (end 
§ 43), the predicate “ a is the Godel number of a true closed formula of N” 
is not expressible in the symbolism of N. 

* § 47. Undecidability and incompleteness using only simple 
consistency (Rosser). For Theorems IV-VI, we made two non- 
elementary assumptions (b) and (c) about the system N, because they make 
the approach to the theorems easy, and because we can hardly doubt that 
they hold for the N of § 38 or any system we would want to use instead. 

However, to formulate Theorems IV-VI as elementary metamathemati- 
cal theorems, (b) and (c), or something to the same effect, should be 
included in the hypotheses. This we have done in the second paragraph of 
each theorem. 

Theorem VI followed from Theorem V by noting that (c) can be replaced 
by the simple consistency and (a). 

There remains (b), which we used in proving Theorems IV and V (iii). 
Consider any N (like that of § 38) in which C„ is of the form 3xT(a, a, x) 
where T(a, a, x) -* { h T(a, a, x) in N}, whence (a), and 

(g) T(a, a, x) -*■ { b ->T(a, a, x) in N}. 

We can then replace (b) in proving Theorem V (iii) by the hypothesis that 
N is “re-consistent” in the following sense, due to Godel 1931. A system 
S whose notation includes the numerals (§ 43) is o>- consistent if in it, for 
no variable x and formula A(x), do all of h A(0), f A(l), h A(2), . . . and 

Solution in IM p. 285 Theorem VII (d), using general recursiveness instead of 
Turing computability. 

i0 “ Included in IM p. 285 Theorem VII (b). Using this to generalize Exercise 46.8 to 
allow symbols for any decidable predicates, we obtain the proposition which led Kleene 
in 1940 abstract to consider exactly the list of predicate forms in (F). 
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I -iVxA(x) hold (otherwise, S is 10 -inconsistent). (An w-consistent system 
S is simply consistent, as follows by writing any formula E as A(x) where 
x is a variable not occurring in E, so that A(0), A(l), A(2), ... are all E and 
~iVxA(x) is equivalent to - iE by *75.) To prove (iii) from the ^-consistency 
(with (a), (d) and (g)), suppose I- C„, i.e. I- 3xT (p,p, x), whence 
I- -i-)3xT(/>, p, x), whence (using *82a) b -nVx“iT(/), p, x). But by (i) 
(already proved in §44 from (a), (d) and the simple consistency), 

( Ex)T(p,p,.r ), whence (x)T(p,p,x), whence by (g) (*){ I- ~H(p,p, .v)}, i.e. 

b -IT (p,p, 0), 1- “iT (p,p, 1), H ~iT(p, p, 2), . . .. These with 

I- - iVx - iT(p, p, x) above contradict the hypothesis of w-consistency. aotl 

/' Clearly, an <udnconsistent system violates the' interpretation of x as 
/ ranging over the natural mfmbers, expressed by the numerals 0, 1,2,.... 

It is <i consequence of Theorem V (ii) that there arc simply consistent but 
< o-incon.sistent systems of number-theory. (In substance, this is Exercise 
44.3, which we do now.) For, let N be the number-theoretic system of § 38, 
whose consistency we assume (on the basis of the interpretation or of 
Gentzen’s proof). Let M come from N by adding C„ as an axiom. Then, 
by arguments just given (using (g) and Theorem V (i) for N), M is co- 
inconsistent. But M is simply consistent; for, if b E and I — iE in M, then 
C„ b E and C„ b ~iE in N, whence by “i-introd., b in N, con- 
tradicting Theorem V (ii). 

Thus not only does Godel’s second theorem show that simple consistency 
is hard to prove for number theory, but also his lirst theorem shows that 
simple consistency is not the only consistency property that we will ordi- 
narily wish our formal systems for number theory to have. The proofs by 

" U ' J Using (g), co-consistency implies 

(b') (tC. in N} — (Ex)T(a, a, x), 

and hence classically (b). For, assume h C„ and {Ex)T(a, a, r). Thence (x)T{a, a,x) 
whence by (g): (a) (,r)( I- — lT(a, b, *)}. But also (as for a=p in the text): 
(n) (• — |Vx— iT(«, a, x). But co-consistency says (a) and (b) can't both hold; so by 

reductio ad absurdum (Ex)T(u, a, x). 

The proof of Theorem IV in §43 can be reworked to proceed intuitionistically from 
(b ) instead of (b), thus. Suppose there were a decision procedure for provability in 
iN. We could use it to construct a machine (IJt„ which, applied to a, attempts (successfully 
if ( C.r)T(a , .<;)) to compute (p a (u) + \ if I- C„ (Case 1), and writes 0 if not L C„ (Case 
2). Then as for Theorem II we can deduce a contradiction from (Ex)T(p, p, x) (after 
using (a) to infer i C„), and also from ( Ex)T(p,p , x) (after using (b') to infer not b C„). 
So by informal use of weak "l-elim. § 1 1 : (c) (Ex)T(p, p, x) 0#0, 

(l>) (Ef)T(p, p, .<■)-*• 0^0. From (c ) by contraposition twice ( * 1 3, *12 in §24), 
(e) (Er)T(p,p. ’■) -> 0^0. Now wc get a contradiction by cases (as above, for a = p). In 
Case 1, by (b ) ( Ec)T(p,p , x), whence by (e) 0^0, contradicting 0=0. In Case 2, by (a) 
and contraposition ( Ev)T(p,p , j - ), whence by (d) 0^0 
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Gcntzen etc. for N § 38 do actually establish w-consistency. At the same 
time, the problem of interpretation, mentioned at the end of § 44, is 
emphasized, because we have no reason to stop at securing only simple 
and ^-consistency. 

Meanwhile, in 1936 Rosser found another method of proof of Theorem 
IV, and of Theorem V with a different number q in place of p, in which 
all the results are obtained with simple consistency as the only nonelemen- 
tary hypothesis. We shall obtain these results now as corollaries of a more 
general theorem (Theorem VIII). 

We„ call _ a non-empty set or class S of natural numbers cotnpiilab/v 
' enumerable or recursively enumerable if there is a computable function ip 
such that <p(0), </j(1), q(2), ... is an enumeration (possibly with repetitions) 
of S. 210 Two sets S 0 and S, are disjoint if they have no common elements, 
1 or in symbols if S a r> 5, = 0 (cf. § 26). 

Theorem VIII. There are two ( non-empty ) disjoint recursively enumer- 
able sets C u and C 1 with the following property. Given any two disjoint 
recursively enumerable sets D a and D l including C° and C 1 respectively , 
i.e. any two recursively enumerable sets D° and D x such that 

(1) D a D l — 0, 

(2°) C° £ D\ (2 1 ) C l S D\ 

a number f can be found which belongs to neither D° nor £>', i.e. such that 
(3 °)/£D«, (i x )f$D\ 

(A symmetric form of Godel’s theorem, Kleene 1950.) 211 

Proof, using an idea of D. Lacombe (reported by Rabin 1958a Footnote 
6). With < pja) as in § 42 and “d” meaning “the a' s such that” (§ 26), let 

C° = d{q> a (a) is defined and i p u (a) = 0}, 

C l = a{tp„(a) is defined and yfa) 0}. 

Clearly C° and C 1 are disjoint (and non-empty). 

210 Such sets were first considered in Kleene 1936, using the general recursive func- 
tions (afterwards proved equivalent to the Turing computable functions; cf. § 41). The 
term “recursively enumerable” has become the standard one in the literature (Rosser 
1936, Post 1944, Smullyan 1959, etc.). Cf. IM pp. 306-307. Also the empty set is often 
taken to be “recursively enumerable” (Post 1944). 

211 The sense in which this is a form of Godcl’s incompleteness theorem is elaborated 
in IM § 61, which uses the original proof (Kleene 1950). 

The sets C" and C 1 are disjoint recursively enumerable sets which are “recursively 
inseparable” in the following sense: for no general recursive set D, C° c p & C* c D 
(where /> = the complement of D). 
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Suppose given recursively enumerable sets £>° and D l satisfying (1), (2°) 
and (2 1 ). Let a Turing machine 9)1, be constructed (incorporating machines 
which compute functions cp° and / enumerating D u and D 1 respectively) 
which carries out the following operation. Applied to a , 9JI, searches 
through the enumerations <p° and <p l of D 0 and D l respectively, looking 
for a. (4)1, works alternatively, some on and some on <p l , so that if the 
search is not stopped, it will eventually reach arbitrarily far out in each 
enumeration.) If 4W, finds a in the enumeration (p° of D°, it thereupon 
writes 1 and stops. If 9M, finds a in the enumeration q> 1 of D', it thereupon 
writes 0 and stops. If neither of these events occurs, 4H, continues the 
search ad infinitum, and so computes no value. 

To establish (3°), assume that f e D°. Then by (1 ),/ $ D 1 . So /is in the 
enumeration y° of D°, but not in the enumeration <p l of D 1 . So 9JI, applied 
to / will find fin the enumeration q>° (it cannot be prevented from finding 
/there by first finding/ in the enumeration rp l ). So <P/(f) (the value com- 
puted by S .U{ 7 applied to /as argument) is defined, and <p,(f) = 1 by the 
way DJI/ operates. Hence by the definition of C\ f 6 C 1 . Hence by (2 1 ) 
/ (.- l)\ contradicting/ £ D l (above). By reductio ad absurdum ,/ $ £>°; i.e. 
(3°) holds. 

The proof of (3‘) is symmetric to that just given for (3°). — 

A formal system S' is an extension of a formal system S, and S is a 
subsystem of S', if each formula of S is a formula of S' and each provable 
formula of S is a provable formula of S'. (Any formal system S is the 

“improper” extension of itself.) 

Corollary 1 . Assume the system N of § 38 to be simply consistent. To 
any simply consistent extension N' of N (including N itself), there is a dosed 
formula CJ of N such that : (i) — iC” is true, (ii) not I — iC" in N', (iii) not 
V q in N'. 

More generally, this applies to any simply consistent formal system N in 
which, to each a, there can be found effectively formulas C® and Cj ( expressing 
a €■ C° and a c C 1 respectively) such that (a°), (a 1 ), (b°), (b 1 ) below hold. 

Proof. The same methods which enable us to find in N § 38 a formula 
C,, to express (Ex)T(a, a, x) (beginning § 43) enable us now to find formulas 
C“ and C* expressing a e C° and a e C 1 respectively. Indeed, let 

U(i, a, x) = {Machine 3J applied to «, at Moment x scans a square 

next to the right of a blank square} 

(cf. §41). A formula U(i, a, x) expressing U(i, a, x) can be found, as before 
we found T(i, a, x) to express T(i, a, x). Let 

C" be 3x[T(a, a, x) & U (a, a, x)], C* be 3x[T(a, a, x) & "iU(a, a, x)]. 
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The same considerations which before gave (a) now give 

(a 0 ) a e C® -► { b C® in N}, (a 1 ) a e C 1 -* (b C 1 in N}, 

as can be proved in detail. Also, corresponding to the obvious disjointness 
of C" and C l , it can be shown that 

(b°) a e C«-> {b - 1 C 1 in N}, (b 1 ) a e C 1 -* {b ~iC® in N}. 

Now let N' be any simply consistent extension of N. Let 

D° = d{ b C“ in N'}, D' = d{ b -v C® in N'}. 

Then (1) of the theorem is satisfied, by the simple consistency of N'. Also 
(2°) holds; for, if a e C°, then by (a 0 ) b C® in N, and hence (since N' is 
an extension of N) b C® in N', i.e. a e Z>°. Similarly, using (b 1 ), (2 1 ) 
holds. 

Because of the nature of any formal system, here N', Turing machines 
4)1° and iOt 1 can be found which compute <p°(n) and </>'(»), where cp°{ri) is 
the a of the wth proof (in some enumeration of the proofs in N') which is 
a proof of C“ for some a, and <p\n) likewise for “iC®. Thus D° and D 1 
are recursively enumerable. 

So all the hypotheses of the theorem are satisfied. Hence there is a num- 
ber f satisfying (3°) and (3 1 ). Then by the definitions of D° and D 1 , (iii) and 
(ii) of the corollary hold. By (3°) and (2®),/ <£ C“; so (i) also holds. 212 — 
We call a formal system S essentially undecidable (after Tarski 1949 
abstract), if S is simply consistent, and every simply consistent extension 
of S (including S itself) is undecidable. 

Corollary 2. Assuming the system N of § 38 to be simply consistent, 
it is essentially undecidable. 

More generally, any formal system N satisfying the second paragraph of 
Corollary 1 is essentially undecidable . 

Proof. Assume N simply consistent, and let N' be any simply con- 
sistent extension of N. Now we take 

D® = a{ b C® in N'}, D 1 = d{not b C® in N'}. 

Now (1) is immediate. As before (Corollary 1), (2°) holds by (a®). Also 
(2 1 ) holds; for, if a e C\ then by (b l ) b mC® in N and hence in N', so by 

212 Symmetrically, (i)— (iii) hold with C) replaced by C) for a different choice of/. 
At least for the N of § 38, by using (A) in § 46 to write a e C° or a e C 1 in the form 
(Ex)T(O(a), 0(a), x), we have (i)-(iii) as they read in Theorem V but for p replaced by 
q = 0(f) with cither of the present /’ s. 
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the simple consistency of N' not h C“ in N', i.e. a 6 D l . As before, D° is 
recursively enumerable. 

Now assume that there is a decision procedure for provability in N', 
i.e. there is a machine that decides, for any formula A, whether or not 
I- A in N'. Using this machine, we could find a machine to compute a 
function < 7 1 which enumerates the present /)’, so D 1 would also be recur- 
sively enumerable. So by the theorem there would be a number/such that 
J\[_ /)" and f (j D'. With the present D u and O', this is absurd. 

Applications ro the decision problems tor axiomatic theories 
(Tarski). Rosser’s motivation in 1936 was presumably simply to strengthen 
Theorems IV and V by using the simple consistency of N in place of the co- 
consistency. A fertile area of application of Corollary 2 has been cultivated 
since 1949 by Tarski and his coworkers . 213 This is to showing the undecid- 
ability of various axiomatic theories formalized in a logical calculus, which 
may be either the predicate calculus or the predicate calculus with equality. 
(Tarski uses the latter.) That is, the axioms of the theory are stated in the 
symbolism of the calculus, with predicate, (individual) and function sym- 
bols; and the calculus provides the logic. Examples of axiomatic theories 
thus formalized are the formal system N for number theory in § 38 (whose 
undecidability we already know) and the systems G, Gp, AG, AGp for 
groups and Abelian groups in § 39. As we remarked for N and G, in such 
systems the deduction theorem and our other introduction and elimination 
rules hold (Theorems 13, 21). 

By the undccidability of a formalized theory or formal system S we 
mean, as above, that there is no decision procedure for answering all of 
the questions whether a given formula A is provable in the system S 
(formalizing the theory). As we saw in the proofs of Theorems IV and VII, 
and further in § 46, when we know the decision problem for one class of 
questions ( P ) to be unsolvable, we can infer the like for another class ( Q ) 
by reducing the questions in the first class to questions in the second (or 
briefly, by reducing the first decision problem to the second). In particular 
for formal systems, if S.> is undecidabie, we can infer S t to be undecidable 
if we can find effectively, to each formula B in S 2 , a formula B' in Sj such 
that I- B in S 2 if and only if b B' in S,. 

A formal system S based on the predicate calculus (without or with 
equality) as the logic with additional or “nonlogical” axioms is said to be 
finitely axioma tizable (aft er Tarski 1949 abstract), if the number of those 

- u Theorem Vlfl itself answered a question concerning analogies between the 
hierarchies described in §46 (Kleene 1943, Mostow$ki 1947, etc.) and the hierarchies 
studied in "descriptive set theory” (Borel 1898, Lusin 1930, etc.); cf. Kleene 1950, 
Addison 1960. Other applications are e.g. in Kleene 1956, Rabin 1958a, Kleene and 
Veslev 1965 pp. 112, 183. 
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nonlogical axioms is finite, or all but a finite set of them can be omitted 
without changing the class of the provable formulas; such a finite set of 
nonlogical axioms of S (or all, if S has only finitely many nonlogical 
axioms) we may call a finite axiomatization of S. 

In the case of a syste m like N which is given with infinitely many non- 
logical axioms (the 8 particular axioms 14-21 and the N 0 -axioms by the 
Axiom Schema 1 3), it ma y not be obvious a priori whether the system i s 
finitely a x to mat i zabl e or~ n o t . (The author first heard this question asked 
about the N of § 38 during 1949; its solution, in the negative, was published 
by Ryll-Nardzewski in 1952.) 214 

The application of the reduction method to show a system S x undecid- 
able depends on having available a system S 2 already known to be undecid- 
able whose “theory” can be developed in S x (via some translation of 
formulas B in S 2 into formulas B' in S x ). The simpler S 2 is, the better the 
chance we can do this in a given Sj. 

Tarski 1949 abstract recognized that we will be in a particularly favor- 
able position for doing this, if we have first of all a system S which is both 
essentially undecidable and finitely axiomatizable, besides being simple. For 
then each system S x ( with all the symbols which S has ) is undecidable which 
has a simply consistent common extension S 3 with S. To prove this, consider 
the system S 2 which has as its (nonlogical) axioms those of S : and those 
in a finite axiomatization of S (and just the symbols of S x ). Then S 2 is a 
subsystem of S 3 , so S 2 is also (simply) consistent. Also S 2 is an extension 
of S; so by the essential undecidability of S, S 2 is undecidable. Now con- 
sider the axioms of S 2 which are not axioms of S x . They are finitely many, 
since they come from a finite axiomatization of S; say they are A 1( • ■ ■ > A m . 
So F B in S 2 if and only if VA X , . . . , VA,„ F B in S x , which by the deduc- 
tion theorem etc. (Corollaries Theorems 10 and 1 1) is the case if and only 
if F VA x D (. . . (VA,„ 3 B) . . .) in S x . Thus we have reduced the decision 
problem for the undecidable system S 2 to that for S x . So S x is undecidable, 
as was to be shown. 

To make things simple, we began with the case S x lacks no symbols of S. 
More generally, S x may lack predicate or function symbols of S, provided 
they can be “defined” or “interpreted” in a consistent common extension 
S 3 with S. For example, if S x has the notation of N while S has the predicate 
symbol <, the consistent common extension S 3 could have provable in it 

By Corollary Theorem 31 § 29, the theory of equality for a finite list of predicate 
and function symbols is finitely axiomatizable in the predicate calculus. 

If some finite set of formulas of a system S can be used as the nonlogical axioms 
instead of the original ones (i.e. without changing the class of the provable formulas), 
then some finite subset of the original nonlogical axioms can be (so S is finitely axio- 
matizable, under the above definition). Why? 



280 


COMPUTABILITY AND DECIDABILITY 


CH. V 


the formula a 3r /+<•/=/?. If S has the symbol !, S s could have 
provable a\—( ~ F(/r, fi) where F(«, /) expresses the representing 
predicate a\=b of the function a! (cf. end § 38). The additional details 
which this requires in the above argument that S x is undecidable are outside 
the scope of this book. 215 

Since Rosser 1936, it has been known that N is essentially undecidable 
(though the name “essentially undecidable” was first used by Tarski 1949 
abstract). However N is not finitely axiomatizable, as we have mentioned 
(Ryll-Nardzewski 1952). The systems of axiomatic set theory of von 
Neumann 1925, Bernays 1937-54 and Godel 1940 (if consistent) are 
essentially undecidable, since they include N (if the symbolism of N is 
suitably identified within them), and they have only finitely many axioms 
(unlike the system of Zermelo and Fraenkel § 35, where the axiom of 
subsets (11) gives rise to X 0 axioms when formalized in the predicate 
calculus). Tarski’s reduction method for decision problems for elementary 
axiomatic theories however calls for an essentially undecidable and 
finitely axiomatizable system which is much more elementary in its 
interpretation than axiomatic set theory. 

Such a system was first given by Mostowski and Tarski 1949 abstract, 
using basically Rosser’s 1936 result (included in our Corollary 2 Theorem 
VIII); their system deals with the arithmetic of the integers . . . , —2, — I, 
0, I, 2, . . . instead of the natural numbers 0, 1, 2 

In 1950 abstract R. ML Robinson showed that a certain subsystem of 
the N of § 38 is_£t&entially undecidab le and finitely ^axim patizable. Using 
the predicate calculus as the underlying logic (he used the predicate cal- 
culus with equality), it is the system having the following 13 non-logical 
axioms: 14-21, the four equality axioms for + and ■, and the formula 
„.=0 V r/>0. (This is a subsystem of N, since its five axioms which aren’t 
axioms of N are provable in N.) To see by Corollary 2 above that Robinson’s 
system is essentially undecidable, we need only verify that it satisfies the 
conditions in the second paragraph of Corollary 1. It is quite clear that, 
by applying methods used in the proof of Theorem VII from Theorem III 
(where a system S! with finitely many axioms sufficed for the relevant 
theory of ( Ex)T(a , a, a;)), we could find a system with only finitely many 
axioms that would do. This is the fundamental discovery with which we 
are concerned now. To make this result more neat, by showing that a 
subsystem of N, and indeed exactly Robinson’s system, will do, requires 


213 They involve the material cited in Footnote 156 § 38. See Tarski, Mostowski and 
Robinson 1953 or IM pp. 437-439. In the case S, lacks function symbols of S, the logic 
should be the predicate calculus with equality, or S, should have the symbol = and the 
equality axioms for the symbols of S, should be provable in S 3 . 
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additional detailed work of sorts which we have been omitting in this 
book. 216 

Using Tarski’s method with a simple essentially undecidab le finitely. 
a jdomatizable system , Tarski and his coworkers have shown the undecid- 
ability of a variety of formalized theories in the arithmetic of integers and 
reals, rings, groups, fields, lattices and projective geometries. In particular, 
by these means Tarski showed that Gp and hence G are undecidable. 217 
However Gp and G are not essentially undecidable; for, their respective 
extensions AGp and AG are decidable, as Szmielew 1948, 1955 showed. 

Exercise 47.1*. 218 For each of the following, say whether a Turing 
machine 931 can be found which performs the described operation. If “yes”, 
give the idea for its construction (as is done for various machines in §§ 42- 
47), not the full details (as in § 41). If “no”, show why not. (We say a 
machine 9JI, enumerates a set C of natural numbers, if 931, computes a 
total function q> t such that ^,(0), ^,(1), (p,(2), ... is an enumeration of C 
allowing repetitions.) 

(a) Applied to i and n, when 931, enumerates an infinite set C, SOI computes 
the /;th number y(n) in an enumeration tp( 0), 9?(1), <p{ 2), ... of C without 
repetitions. 

(b) Applied to / and n, when 931, enumerates a nonempty set C, SOI com- 
putes the /ith number <p(n) in an enumeration <p( 0), tp(l), rp{ 2), ... of C 
which is without repetitions if C is infinite. 

(c) Applied to / and n, when d(Ex)T(f, a, x) is nonempty, 93i computes 
the nth number <p(n) in an enumeration 9 ?( 0 ), <p(l), 9>(2), ... of 
a{Ex)T(f, a, x) allowing repetitions. 

(d) Similarly, with d(x)T(f, a, x) in place of a(Ex)T(f, a, x). 

(e) Applied to i and j, when 931, and 931, enumerate sets Z> # and D l for 
Theorem VI I T (1), (2°), (2 1 ), 931 computes a number/for (3°), (3 1 ). 

First, we may establish that T(i, a, x) and {/(/, a, x) are primitive recursive, and 
hence (by IM Corollary 27 p. 244 with Lemma 18b) numeralwise expressed in 
Robinson’s system by formulas T(i, a, x) and U(i, a, x). l, ‘ Then we take for C“ and C* 
the formulas 3x[T(«, a, x) & Vy(y<x 73 ~ 3T(«, a, y)) & U(«, a, x)] and 
3x[T(a, a, x) & Vy(y<x D ~lT(a, a, y)> & “tUfa, a, x)]. (Here, under the usual 
interpretation, Vy(y<x D ~ iT(a, a, y)) is redundant, but its presence facilitates 
proving (b°) and (b l ) in the weak system of Robinson.) Now (a 0 ) and (a 1 ) follow at 
once. Also it is not hard to establish (b°) and (b*), using the remark in 1M p. 198 after 
the proof of *169. If we do not want to accept the simple consistency of Robinson’s 
system on the basis of its interpretation or the nonelementary consistency proof of 
Gentzen for the N of § 38 (end § 44), an elementary consistency proof is available in 
IM Theorem 53 (a) p. 470. 

* 17 Sec Tarski, Mostowski and Robinson 1953 Chapter 3. 

The undecidability of Gp also follows from the more recently proved result of 
Novikov 1955 that the “word problem for groups” is unsolvable (end §45). 

218 Some of the solutions are in IM pp. 306, 307, 346. 
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(f) Applied to /, when SIR, enumerates a nonempty set C of members of 
d(.v)T\a, a, .».•), '.))t computes another member (not in C) of d(x)T(a, a , x). 

(g) Applied to a, when a is the Godel number of a formula A of N § 38 
(in a fixed effective Godel numbering of those formulas), S )JI decides 
whether A b N I =0. 

(h) Applied to a, when a is the Godel number of a formula A of N (as in 
(g)), ').)£ decides whether it is decidable whether A h N B where B ranges 
over till formulas of N. 



CHAPTER VI 


THE PREDICATE CALCULUS 
(ADDITIONAL TOPICS) 


§48. Godel’s completeness theorem: introduction. We shall 
continue our series of theorems about the propositional and predicate 
calculi, begun in Chapters I— III.* 1 ® 

In the propositional calculus, theoretically every question of provability 
or deducibility is answerable by using truth tables. Of course, practical 
difficulties arise, if we ask the questions about too many or too complicated 
formulas. In the predicate calculus, we cannot actually complete the 
construction of the truth tables when variables are present, except in 
finite domains. 

This difference between the propositional and predicate calculi has 
now been underlined by Theorem VII in §45. By its proof with (A) in 
§46, there is a formula of the pure predicate calculus Pd (§39) whose 
unprovability is equivalent to the truth of Fermat’s “last theorem” 
(§ 40). In effect, mathematicians have labored unsuccessfully for over 300 
years to settle the question whether this one particular formula of the predi- 
cate calculus is unprovable or provable. Of course the predicate calculus 
as we know it hadn’t been formulated 300 years ago. 220 But this example 
illustrates the futility of approaching all questions of provability and 
unprovability in the predicate calculus by using just the ideas of Chapter 
II. The predicate calculus is such a rich system that a host of particular 
problems that are commonly considered in mathematics, using the predi- 
cate calculus as a tool in successive short arguments, can be clothed 
entirely in the pure predicate calculus. Indeed, as the proof of Theorem 
VII illustrates, this is the case for all problems whether a given statement 
holds in a formal axiomatic theory whose axioms are finite in number and 
expressible in the symbolism of the predicate calculus with predicate, 
(individual) and function symbols. 

219 Many of the results of this chapter belong to classical nonfinitary model theory, 
and thus not to inetamathematics (§ 37). 

290 Although characteristic features of the predicate calculus go back to Frege 1 879, 
the first explicit formulation of it as a separate formal system is perhaps in Hilbert and 
Ackermann 1928, according to Church 1956 pp. 288 fT. 
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Nevertheless, there is more that can be learned- by the study of the 
predicate calculus, pure or applied (§ 39), as a logical system. 

We mentioned in §23 that Godel’s completeness theorem 1930 will 
extend Theorem 14 § 12 to the predicate calculus: For each formula F of 
the predicate calculus (§ 16), if F is valid (§ 17), then F is provable (§ 21), 
or briefly if l= F then f F. The theorem will include some further infor- 
mation, and have some other versions. For the present, we address 
ourselves to the problem of how to prove the italicized statement just 
given. 

We now adopt the term “parameters” (after Beth 1953 and Craig 
1957a) as a name for the symbols or syntactical entities in a formula or 
formulas to which values are assigned in entering truth tables (and 
similarly in terms).- 21 A proposition[al) parameter, is an atom in the 
propositional calculus (§ 1), and in the predicate calculus a 0-place ion 
(§ 16). A predicate parameter is an //-place ion for n > 0. An individual 
parameter is a free variable or 0-place meson (§ 28). A function parameter 
is an //-place meson for n > 0. 

A parameter of a formula or list of formulas is one which actually 
occurs in the formula or in some of the formulas. But sometimes we enter 
tables with values of other parameters; cf. beginning §4. 

In this chapter, for definiteness we will usually speak of formulas as 
constructed using individual, function, proposition(al) and predicate 
symbols, as in an applied predicate calculus §39. But the treatment will 
also hold good using instead any 0-place mesons, //-place mesons, 0-place 
ions and //-place ions allowable under formation rules as in §§ 16, 28. 

Returning to our problem, a formula F in the predicate calculus is not 
valid, exactly if F is falsifiable in the following sense: there is some 
(nonempty) domain D and some assignment in D to the parameters of F 
for which F takes the value f. In this case, we call such an assignment a 
falsifying assignment for F in D, and we say F is falsifiable in that D or 
is D-falsfiable. The D and falsifying assignment together may be called a 
counterexample to F. 222 (Replacing f by t, we get the notions satisfiable, 
satisfying assignment, B-satisfiable and example.) 

We now ask whether it is not possible to search for counterexamples to 
formulas F in such a systematic manner that the following will be the case, 

221 Thus in $ 2 (1) the parameters are P, Q, R; in § 17 Example 1 they are P(x), Q, y or 
I\- ), Q, y or simply P. Q, y ; in § 28 Example I , f, x for the term P anti f for the formula; 
and in y 29 Example 2. P, f (but not =). (However in this chapter until § 52, we shall 
not use i} 29, which gives a fixed preassigned value to =.) 

222 If E is a formula of the propositional calculus simply, wc don’t need to mention 
any domain D, and a falsifying assignment or counterexample can be simply an assign- 
ment of t’s and f's to the atoms of F which makes F f. 
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for any given formula F of the predicate calculus: (I) If any counterexample 
to F exists (i.e. if not 1= F), then the search will lead to one. (II) If no 
counterexample to F exists (i.e. if h F), then as we pursue the search that 
fact will eventually manifest itself by the closing to us of all the avenues 
along which we are searching, whereupon we will be in a position to prove 
F (i.e. then I- F). 

This idea was used independently by Beth 1955,~Hintikka 1955, 1955a, 
Schiitte 1956 and Kanger 1957 to give proofs of Godel’s 1 930 completeness 
theorem in which the connection between model theory and proof theory 
comes in very naturally. The treatment below is quite close to Beth 1955, 
which gave the present writer the idea for it. 223 

So we consider how we can search systematically for counterexamples 
to formulas F of the predicate calculus. 

Example 1. Let F be 3x(P 3 Q(x)) 3 (P 3 VxQ(x)). We seek a 
(non-empty) domain D and an assignment in D to P, Q(x) which makes 
(1) 3x(P 3 Q(x)) D(P3 VxQ(x)) f. By the truth table for 3 in § 2, a D 
and assignment which do this must make (2) 3x(P 3 Q(x)) t and (3) 
P 3 VxQ(x) f; and doing both the latter is also sufficient for doing the 
former. For the same reason, to make P 3 VxQ(x) f, it is both necessary 
and sufficient to make (4) P t and (5) VxQ(x) f. 

By the evaluation rule for 3 in § 17, to make 3x(P 3 Q(x)) t (cf. (2)) it 
is necessary and sufficient to pick the domain D so that it contains an 
element, which we may call a 0 , such that (6) P 3 Q(a 0 ) is t. 

To make P 3 Q(a 0 ) t, we have two alternatives. It is necessary and 
sufficient either (i) to make (7) P f or (ii) to make (8) Q(a 0 ) t. We don’t 
aeed to do both (i.e. it isn’t necessary, though of course it would be 
sufficient). So here our search for a counterexample to F splits, and we 
:an follow either of two paths or avenues. 

Consider the first path (i), along which we seek to make P f. But at (4) 
ve already had to make the same formula P t. These two requirements 
ire incompatible. Thus, along this path we cannot have a counterexample. 
This avenue is closed to us; it constitutes a “blind alley” or “dead end”. 

So, if we can get a counterexample at all, it has to be by following the 
econd path (ii). Continuing along this path, to make VxQ(x) f (cf. (5)), it 
s necessary and sufficient that the domain D contain an element a, such 
hat (9) Q(a x ) is f. We have no right to assume this element is the same as 

223 In some respects it more resembles Kanger 1957, as the author learned after 
/orking it out. 

Some of the other proofs of Godel’s completeness theorem are in Hilbert and Acker- 
lann 1938 (= 2nd ed. of 1928), Hilbert and Bernays 1939, Mostowski 1948b, Henkin 
949, 1963, Rasiowa and Sikorski 1950, Rieger 1951, A. Robinson 1951, 1963, Beth 
951, Kicene IM (1952b), 1958. Those listed in bold face apply topology or algebra. 
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the element a„ introduced at (6), so we are using another letter a! for it. 
But now, along this path (ii), we indeed have been led to a counterexample. 
For, our successive analytical steps have shown that it is sufficient for our 
purpose to pick a domain D containing at least elements named a 0 and a^ 
and an assignment to the parameters a 0 , a„ P, Q(x) such that (4) P and 
(8) Q(a 0 ) are both t while (9) Q(a,) is f. This we can do as follows. We pick 
D to be a domain of exactly two elements; say D = {0, I }. To a 0 and a, we 
assign the respective values 0 and 1. To P we give the value t. We evaluate 
Q(x) by the logical function l(x) such that l(0)-is t (so Q(a 0 ) is t) and 
1(1) is f (so Q(a,) is f). (This is the logical function ( 2 (x) of § 17 Example 1, 
allowing for the difference in notation, the elements being named “1” 
and “2” there, “0” and “1” here.) Thus F is falsifiable; so not 1= F. 

This analysis has taken considerable space to give verbally. We now 
adopt a symbolic representation of such analyses. We choose our method 
of representation with the aim of putting before our minds an absolutely 
clear picture of the structure of our searches for counterexamples, 
including both the situations obtaining initially and after successive 
steps, and the over-all structure. These symbolic representations can be 
somewhat cumbersome. But our purpose is to reason about them, rather 
than to use them extensively in practice; so it does not much matter if 
they are cumbersome. 

As a search for a counterexample proceeds, initially and after each 
step, along whichever path we are pursuing (if we have had a choice or 
successive choices), we have two (finite) lists of formulas: a list A of 
(zero or more) formulas which we are aiming to make t, and a list A of 
(zero or more) formulas which we are aiming to make f. The steps of 
analysis up to the one just completed (inclusive) have shown that making 
simultaneously all of A t and all of A f will suffice to make the original 
formula F f. (Initially, A is empty and A is simply F.) 

Also at any stage in the search as a whole, our analysis has shown that, 
to make F f, it is necessary that we make all of A t and all of A f in the 
lists A and A last reached, along at least one of the paths (if we have had 
choices). 

So the situation, initially or after any step, is represented by an ordered 
pair (A; A). For reasons partly historical, we elect to write “A — » A” 
instead of “(A; A}”. Here — » is a new formal symbol (which may be read 
“give(s)”). The formal expression A — > A (for any two finite sequences 
A and A of zero or more formulas each) we call a sequent ; and we call A 
its antecedent and A its succedent. 

It remains to represent the structure of our searches as a whole. This 
we do by arranging the sequents in the order in which we are led to them. 
For reasons partly historical, we place the initial sequent — * F lowest in 
the figure. Each time we perform a step, we draw a line above and write 
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the one or (in case we have a choice) two sequents to which the step leads. 

Thus we represent the analysis or search in Example 1 by the following 
“tree” (left). 


v/ 


X Q(a«), P -» Q(a t ) 

P -* VxQ(x), P Q(a„), P -* VxQ(x) 

P,PPQ(a 0 )^VxQ(x) ^ ^ 

P, 3x(P 3 Q(x)) VxQ(x) 

3x(P =3 Q(x)) -> P 3 VxQ(x) 

— »3x(P 13 Q(x)) 3 (P => VxQ(x)). 



K 00010 


We have placed a cross “ x ” at the top of one path or branch to show that 
this path is (terminated and) closed to us in our search for a counter- 
example, a check ‘V” at the top of the other to indicate that the search 
has terminated successfully there. The symbol *3” indicates that the 
step in question is an analysis of an implication in the succedent (which we 
are aiming to make f), “3 — that it is an analysis of an existence formula 
in the antecedent (which we are aiming to make t), etc. 

The tree of sequents (left) can be considered as the result of placing 
sequents on the vertices of a purely geometric tree (shown without the 
sequents at the right). The geometric tree in this example consists of 7 
vertices in a certain arrangement (a “partial ordering”) shown by the 
arrows. Thus, the sequent Q(a 0 ), P -* VxQ(x) is placed at the vertex 
V mv A path or “avenue” in our search for a counterexample is repre- 
sented by a succession of vertices, starting at the bottom and following 
arrows in the tree. There are two paths in this example, Pfo^oo^ono^oooo 
and KK u K ao K 000 K 0001 K 0(W10 ; they run together as far as the vertex K 000 , 
after which they diverge. 224 

«« An alternative representation of our search is obtained by simply omitting the 
verbal explanations given above, but writing the nine numbered formulas in two columns, 
marked true (t) and false (f). After (6), where we have a choice, the columns' split into 
respective left and right subcoiumns (i) and (ii). This gives what Beth 1955, 1959 calls a 
semantic tableau: 


t 

f 

(2) 3x(P 3 Q(x)) (1) 

(4) P (3) 

(6)P3Q(a 0 ) (2) 

(i) (it) 

(8) Q(a„) (6) 

(1) 3x(P 3 Q(x)) 3 (P 3 VxQ(x)) 

(3) P 3 VxQ(x) (1) 

(5) VxQ(x) (3) 

(i) (>') 

(7) P (6) (9) Q(a,) (5) 


If we intended to use the search method extensively in practice, the tableaux would be 
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Exampli: 2. Let F be 3x(P 15 Q(x))3 (P 3 3xQ(x)). The analysis is 
the same as in Example 1, except with 3xQ(x) in place of VxQ(x), down 
through (8). Now to make (5) 3xQ(x) f, it is necessary that (9) Q(a 0 ) be 
f. This is sufficient if the domain D contains only a 0) but not otherwise. 
So in representing the new situation we don’t omit 3xQ(x) from the list 
of the formulas we are aiming to make f. (We haven’t finally picked D 
yet; we have thus far only committed ourselves to its having at least the 
element a 0 introduced at (6).) However, if we look at the situation we now 
have on the second path (ii), we see that the requirement that (9) Q(a„) 
be f conflicts with the requirement that (8) Q(a 0 ) be t. So in this example 
both paths or avenues open to us in searching systematically for a counter- 
example to F are closed, or the tree itself is closed. This completes an 
informal proof (in a classical observer’s language) that there is no 
counterexample to this F, i.e. a proof that F is valid. Now it would be 
surprising if we could not utilize the method of this informal proof that 
M F to construct a formal proof of the formula F and thus to show that 
f- F. If our formal system of predicate calculus in Chapter II were not 
adequate to do this, we would certainly look for ways to augment it. 
However, we postpone this part of the problem to § 51. The sequent tree 
in this Example 2 is as follows. (Several formulas are in boldface for later 
reference.) 


X 

x Q(a 0 ), P -> 3xQ(x), Q(a 0 ) ) 3 

P -> 3xQ(x). P Qfaa), P -> 3xQ(x) 

P. P 13 Q(a 0 ) — > 3xQ(x) ^ 
P,3x( P 3Q(x))~»3xQ(x) ^ 
3x(P3Q(x))->P3 3xQ(x) 

-»3x(P 3 Q(x)) 3 (P 3 3xQ(x)). 



Before continuing, we observe that we can codify the steps of analysis 
we are using in our searches. In terms of the representation by trees of 


more efficient than the sequent trees, since they do not require us to recopy formulas 
whose status is not being changed. But they do not show the individual situations as 
simply (c.g. that w hen (8) has just been introduced, A is (4), (8) and A is (5)). Hence we 
prefer to use the sequent trees in the proof of Godel’s completeness theorem and other 
theoretical uses.'-'' 1 ’ (The tableau here does show explicitly that, when (8) has just been 
introduced, it suffices to make (2), (4), (6), (8) t and ( I ), (3), (5) f. This kind of information 
taken with the structure or such lists, is enough for recognizing the counterexamples. 
But, altogether, we believe what we shall be doing is more easily visualized in terms of 
the sequent trees.) 
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sequcnts, each step is performable by one of the following 14 rules. 
Thus, we have several times used the principle that, in order to make an 
implication A 3 B f, it is necessary and sufficient to make A t and B f. 
This is now codified by the rule at the upper left, called O” or “the 
3-succedent rule”. The T and 0 tag along to indicate lists of zero or 
more formulas not changed in the step, those in T to be made t and those 
in 0 to be made f. 

a, r— >©,B r — > 0, a b, r — » © 


a, r — » 0 , b 

r —» 0, A 3 B. 

r->0,A r-*©,B 
r — > 0, A & B. 

J^AL_*v 

r-»0,AVB. 

a, r->© 

— i ~»-i 

r-»e, -ia. 

a, r->0, b b, r -» 0, A 
r -> 0 , a ~ b. 
r-»0, A(b> 

P->0, VxA(x) 
where b does not occur 
free in P — > 0, VxA(x). 

P~ * 0, 3xA(x), A(r) _ 3 
P — * 0, 3xA(x). 


A 3 B, T -» 0. 

A, B, P — » 0 
A & B, P -» 0. 

a, r-»e B,r-»0 

A V B, P— > 0. 

P — > 0, A 
-iA, P->0. " 1_> 

A, B, P — > 0 P — > 0, A, B 
A ~ B, T — * 0. 

A(r), VxA(x), P — > 0 
VxA(x), P-»0. 


A(b), r-40 j ^ 

3xA(x), P — * 0 

where b does not occur free 
in 3xA(x), r-> 0. 


In these rules, A and B are (allowed to be) any formulas; x is any variable; 
A(x) is any formula; b is any variable free for x in A(x) (and unless b is 
x not occurring free in A(x)); r is any variable not necessarily distinct from 
the other variables present, or (if formation rules as in § 28 are used) any 
term, free for x in A(x); A(b) and A(r) are the results of substituting b 
and r respectively for the free occurrences of x in A(x); and T and 0 are 
any (finite) lists of (zero or more) formulas. 

In using the rules — »V and 3 — >, we must obey the restriction on 
variables (stated with the rules); briefly, b shall not occur free in the 
sequent below the line. (When the A(x) does not contain the x free, then 
A(b) is A(x) no matter what variable b is. Wc agree in such a case to 
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choose for the analysis a variable b not occurring free in the lower sequent, 
so the restriction will be met,) 

The order of listing the formulas within any antecedent and within any 
succedent is to be immaterial in applying the rules. Thus in Examples 1 
and 2, the rule 3 — * (bottom right) applies from V 00 to V 000 (with a 0 as the 
b), even though 3x(P 3 Q(x)) is not written first in the antecedent. 

Let us extend our evaluation process from formulas to sequents, thus: 
A— »A shall take the value f when all of A are t and all of A are f; 
otherwise, the value t. We say a sequent A — » A is falsifiable , if for some 
(non-empty) domain D and some assignment in D to (at least) all its 
parameters, it takes the value f. We say A — * A is valid, or in symbols 
f=A— >A, in contrary case that, for every (non-empty) domain and 
assignment, A — > A is t. 

Each of the 14 rules has been picked (by reasoning illustrated in 
Examples 1 and 2, (a) for necessity and (b) for sufficiency) to have the 
property stated in: 

Lemma 6. For each of the 14 rules — >3, . . . ,3 — * listed above, with 
the stated stipulations: The sequent written below the line is falsifiable , (b) 
if and (a) only if the sequent, or at least one of the two sequents, written 
above the line is falsifiable. Equivalently: The sequent written below the 
line is valid, (a) if and (b) only if the sequent, or each of the two sequents, 
written above the line is valid. 

It is of course quicker to use the rules than to think through the 
principles embodied in them at each step. 

Proceeding upward, we can close a path (signifying that we have lost 
hope of finding a counterexample along it) when we have reached a 
sequent which we recognize cannot be made f. We codify this by saying 
we shall close a path as soon as we reach a sequent of the form 

(x) C, T->0, C. 

Here we could allow C to be any formula. However it will be useful 
later (in §§ 55, 56) to know that mistaken attempts at counterexamples can 
always be rejected using an atom (prime formula) as the C. So we stipulate 
here that C be such. As with the rules, F and 0 are any lists of formulas; 
and the order of formulas within the antecedent and within the succedent 
is immaterial. 

Lemma 7. No sequent of the form (x) is falsifiable. Equivalently: 
Each sequent of the form ( X ) is valid. 

Example 1 illustrates the situation envisaged in (I) of our proposed 
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lan of attack on the completeness problem (tenth paragraph of this 
;ction). Example 2 illustrates (11), provided we can convert the closed 
;quent tree into a formal proof of F. We give three more examples. 
Example 3. 


P(a„, a t ), P(a t , a 2 ), P(a 2 , a 3 ), Vx3yP(x, y) -> ^ 

P(a 0 , a,), P(a ; , a 2 ), 3yP(a 2 , y), Vx3yP(x, y) -» ^ 

P(a 0 , a t ), P(a t , a 2 ), Vx3yP(x, y) -» ^ 

P(a 0 , a,), 3yP(a t , y), Vx3yP(x, y) -> ^ 

P(a Q , a t ), Vx3yP(x, y) -» g ? 

3yP(a 0 , y), Vx3yP(x, y) -» y ^ 

Vx3yP(x, y) — » _ 

— * ~iVx3yP(x, y). 

Here there is a single path (no branching). This path can be pursued 
upward ad infinitum: along it we never reach either a sequent of the form 
(x) at which we can close it (as at K 0000 in Example 1, and at both 
and Fqooio in Example 2), or a sequent from which no further step upward 
can be made by our rules (as at K 00010 in Example 1). Does this tree lead us 
to a counterexample? 

Indeed it does, under either of two approaches. At we know that 
to falsify -iVx3yP(x, y) it is (necessary and) sufficient to make P(a 0 , a,) 
and Vx3yP(x, y) both t. There are two possibilities: a 0 and a t are the 
same element of the domain D, or different elements. Under the first 
approach, wc try making D = {0} with 0 as the value of both a„ and a^ 
We evaluate P by the logical function l for which 1(0, 0) is t, so P(a„, a,) 
is t. Evidently this also makes Vx3yP(x, y) t; or we can argue that for 
a D with only the element a 0 we could have omitted Vx3yP(x, y) at V 00 
and hence at K 000 . Thus we have a counterexample, indicated by the tree 
just up to V 0l)0 . 

Now let us instead try having a 0 and Uj different elements of D, and 
(following this approach consistently) likewise having each of a 2 , a 3 , 
a 4 , . . . in turn a new element. Let D = {0, 1,2,.. .}, and assign to a„, 

aj, a 2 , . . . the values 0, 1, 2 To P let us assign the logical function l 

such that l(x, y) is t when x and y are consecutive natural numbers, and 
always f (or always t) otherwise. This makes all the atoms in the antecedents 
t. The student should have no trouble in seeing that consequently it makes 
also the molecules there t, and hence makes ~iVx3yP(x, y) in the succedent 
at the bottom f. So we have another counterexample, corresponding to the 
whole infinite path. 
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Let us consider the two approaches in general. 2 ; 5 When we try for a 
counterexample, at each step upward by — >V or 3— » we have a choice 
between the two for the variable b introduced in that step if any variables 
were previously introduced (as at (9) or F 00010 in Example 1, and at K 000 
in Example 3, with a t as the b). Suppose there is a counterexample using 
the first approach, i.e. with b standing for the same element of D as some 
other variable a already introduced (as a 0 in Example 3). Then there is 
also a counterexample using the second approach. We can get this from 
the first counterexample by splitting the element represented by both a and 
b into two elements (enlarging D by one element) and treating these two 
elements (one to be used as value of a and the other of b) like the one 
element they replace in constructing the logical functions (and functions 
with values in D if mesons are present) for the falsifying assignment. 226 
(As we have seen, in Example 3 at K 000 both approaches work; in Example 
1 at V oam only the second works.) So we can never miss finding a counter- 
example (if a counterexample exists at all) by confining ourselves to the 
second approach. The treatment in §§49, 50 is set up on this basis. In 
it, we will provide for terminating a path in a sequent tree (even though 
the rules could be applied further) when a stage is reached from which a 
counterexample can be read off under the second approach. For our 
primary purpose of proving Godel’s completeness theorem, the additional 
possibilities for finding counterexamples afforded by the first approach 
are an unnecessary distraction. Of course, the first approach will often 
lead to a counterexample more quickly or to a simpler counterexample 
(as in Example 3). 

In Example 3, using the second approach, we' saw how an infinite 
counterexample (i.e. one with an infinite D) can be read off a suitably 
constructed infinite path. In Example 3, there was also a finite counter- 
example, using the first approach. Our last two examples will illustrate two 
points: (a) there may be only infinite counterexamples (Example 4); (b) 
without some over-all plan governing our searches, they may not lead to 
a counterexample or a closed tree, even though there be one or the other 
(Examples 4 and 5). 

255 The reasoning in this paragraph is to make our procedure in §49 understandable 
in advance. The fact that by it vve do succeed in proving Godel’s completeness theorem 
would be sufficient justification for it. 

228 Wc showed such a splitting in § 30 Example 5, where it was a question of finding 
an example for a formula E rather than a counterexample. Until § 52, our evaluation 
rules arc those of the predicate calculus § 17 or the predicate calculus with functions 
§ 28, not those of the predicate calculus with equality §29. Meanwhile, if = should 
occur in any of our formulas it is to be treated like any other predicate parameter. 
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Example 4. Let G be 

Vx~iP(x, x) & VxVyVz(P(x, y) & P(y, z) D P(x, z)). 


G, 3yP(a„, y), Vx3yP(x, y) -» $ K 000 

G, Vx3yP(x, y) —* . ? \v w 

G & Vx3yP(x, y) — > OK 0 

-> -i(G & Vx3yP(x, y)). ^ 6 V 


The tree shown is to be continued upward ad infinitum from K 00 with the 
same sequents as in Example 3 from V 0 except that G is at the front of 
each antecedent. 

The one path in this tree does not indicate to us a counterexample. For, 
in building it upward as proposed, we spend forever analyzing the condi- 
tions for the truth of Vx3yP(x, y) (as in Example 3) and never get to 
those for G. 

Now we show that there is a counterexample to ”1(0 & Vx3yP(x, y)) 
with D = (0, 1,2,...} but no finite counterexample. 

To do this it will suffice to show that G & Vx3yP(x, y) is t for a suitable 
assignment in D = {0, 1, 2, . . but is always f in any finite (non- 
empty) domain. 

It is easy to see what is going on, if in G & Vx3yP(x, y) we not only 
unabbreviate G, but take the outer conjunctions apart, omit the universal 
quantifiers, and change P( — , — ) to — < — : 

— ix<x, x<y & y <z D x<z, 3yx<y. 

These order axioms are all true (with the free variables in the generality 
interpretation, §§ 20, 38) when D = (0, 1,2,...} and < stands for the 
usual order relation between natural numbers. Thus G & Vx3yP(x, y) is 
t (and ~i(G & Vx3yP(x, y)) is f), when D = (0, 1,2,...} and to P is 
assigned as value the logical function ( such that Ific, y) is t when x<,y 
and f otherwise. 

But these order axioms cannot be satisfied in any finite (non-empty) 
domain, as we now show. Consider e.g. a domain D with just three 
elements. Say one of the elements is a 0 . By 3y x<y, there is some y such 
that a 0 <y; by - ix<x this y cannot be a 0 ; say it is a t (different from a 0 ); 
thus a 0 <a l . Again using 3y x<y, there is a y such that a x <y; by “ix<x 
this cannot be a^ and also it cannot be a 0 , since then a 0 <a 1 and a^a,, 
with x<y & y<z x<z would give a o <a 0 , contradicting ~ix<x; so this 
y is the remaining element a 2 ; thus a^a,. Now by 3yx<y, there must 
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be a y such that a 2 <y ; but in the manner already illustrated, this y cannot 
be a 0 or a x or a 2 . So it is impossible to have all three formulas written with 
< true simultaneously, if D has just three elements. If we construct the 
truth table for G & Vx3yP(x, y) with D = {0, 1,2} in the manner of 
§ 17 (it will have 2” = 512 lines), we shall get a solid column of f’s. The 
like will be the case for every finite (non-empty) D (i.e. for B = 1, 2, 
3, . . .). 

The fact that there are formulas which are valid (not falsifiable) in 
every non-empty finite domain, but are not valid (are falsifiable) in 
D = { 0,1,2,...}, was first noticed by Lowenheim 1915; the present 
example - i(G & Vx3yP(x, y)) is from Hilbert and Bernays 1934 pp. 
123-124. 

We shall not take the space now to illustrate how our search procedure, 
when suitably managed (differently than above), leads to a counterexample 
in Example 4 (Exercise 49.3). 

Exampli; 5. Take the tree as in Example 4, but with Q & — iQ as the 
G. There is no counterexample (since at K 00 we can’t make G t). But 
as we are misconducting the search, this doesn’t manifest itself by the 
path becoming closed. 

from Example 4 (and Example 3 if we confine ourselves to the second 
approach) we see that we must interpret (I) broadly enough to include 
counterexamples that are only developed by pursuing the search along 
some path ad infinitum. But our aim is to show for (II) that, in a properly 
managed search, if no counterexample exists (F is valid), that fact will 
always manifest itself by the closing of all paths after a finite amount of 
searching (as in Example 2). This is why, when F is valid, we shall always 
be able to find a proof of F (§ 51). 

We could have anticipated from Theorem VII of Church that, if under 
(II) we are always to have a finite closed tree, then under (I) we cannot 
always effectively learn the existence of a counterexample in finitely many 
steps, whatever procedure we adopt. For, if we could, then we would 
have an algorithm or decision procedure (§ 40) for answering all questions 
whether a formula F in the predicate calculus is valid. By Godel’s 
completeness theorem (which we are aiming to prove) with Theorem 12 1>d 
(§ 23), this would be an algorithm for provability in the predicate calculus, 
contradicting Theorem VII. 227 

This situation is the reverse of what we might naively have expected 
back in § 17, where we found some finite counterexamples, but required 
general reasoning to establish validity. 


r - 7 Also, we can easily adapt the proof of Theorem VII to show directly (without 
using Godel's completeness theorem) that there is no algorithm for validity. 
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In Example 3 (with the second approach) and Example 4, although the 
counterexamples are infinite, the logical functions or predicates can be 
described effectively; 164 i.e. there are algorithms for them. This will not 
always happen under (I). 228 

Exercises. 48.1. By a systematic search (presented as a sequent tree), 
either find (and describe) a counterexample, or show that there is none, 
(a) PVQDP&Q. (b) (P 3 -iP) 3 -iP. 

(c) P V VxQ(x) 3 Vx(P V Q(x)). (d) 3xP(x) & 3xQ(x) 3 3x(P(x) & Q(x)). 

48.2. Show that (under the definition preceding Lemma 6): 

(a) P A l5 . . . , A w — > B l5 . . . , B„, if and only if, for each (non-empty) 
domain D and each assignment in D to the parameters of 
A lf . . . , A,„ — > Bj, . , . , B n , either m > 0 and one of A 1? . . . , A m is f, 
or n > 0 and one of B 1( . . . , B„ is t (briefly: either some of A 1; . . . , A m 
are f, or some of B x , . . . , B„are t; cf. § 26). (b) Hence: P A„ . . . , A„, — * B 
if and only if A 1( . . . , A w (= B (§ 20). 

48.3*. Using insight (as in Example 4) rather than the systematic 
search procedure, find a counterexample in D = {0, 1, 2, . . .} to 

-i{Vx-iP(x, x) & VxVyVz(P(x, y) & P(y, z) 3 P(x, z)) 

& Vx3yP(x, y) & Vy3xP(x, y)}. 

Show that there is no finite counterexample. 

§49. Godel’s completeness theorem: the basic discovery. 
Examples 4 and 5 have shown that we must adopt some over-all plan to 
guide our systematic searches for counterexamples to formulas F, if we 
are always to obtain either a counterexample or a closed tree. 

The forms of the individual analytical steps upward have been described 
adequately by the list of 14 rules. Prior to adopting a search plan, we shall 
note some features of the steps (or the rules for them). 

In each step upward by one of the rules, we recopy the list of formulas 
we are aiming to make t and the list we are aiming to make f, with a 
revision or two alternative revisions. The revision comes from analyzing 
the conditions for the truth t or the falsity f of one of the formulas (the 
principal formula in the step) with respect to its outermost propositional 
connective or quantifier (the principal operator). On the basis of this 
analysis, we introduce into our lists one or two other formulas (the side 
formula(s)). Thereupon the principal formula becomes redundant and is 
omitted, except in the two rules and — >3. The remaining formulas 

2is The logical functions or predicates can always be of degree ^ 1 § 46, by IM 
Theorems 38 and 40 pp. 398, 401 with Theorem XI (Post’s theorem) IM p. 293. Further 
information about these predicates is given in Mostowski 1954 pp. 284-285. 
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(extra formulas) are rccopicd unchanged. For example: 


side formula 

0, 3xA(x), A(tp_ 

— 


' T — *J r), 3 xA(x) 

- extra formulas 
principal operator 
principal formula 



r- 


sidejbrmulas-, 

0 , BJ' 0 


rA D B, l 1 -» 0 

v v 

f extra formulas 
'principal operator 
principal formula 


Hence in an obvious way, when we make a step upward by one of the 
rules, each formula occurrence (as one of the formulas listed as the 
antecedent or as the succedent) in the sequent or in either sequent above 
the line "originates from” or belongs to or is an ( immediate ) ancestor of a 
particular formula occurrence in the sequent below the line (its immediate 
descendant). Here we assume that, in any case of doubt, an analysis has 
been adopted fixing which formula occurrence is the principal formula, 
which are the side formula or respective side formulas, and which extra 
formula occurrence above comes from each one below. 229 

We can trace these ancestral relationships upward (or descendant 
relationships downward) through a succession of steps. Also, besides the 
ancestors of a formula occurrence in sequents above its own ( proper 
ancestors), it is convenient to consider it as its own ancestor in its own 
sequent (its improper ancestor ); and likewise with descendants. To 
illustrate this, in Example 2 all 6 ancestors of the formula (occurrence) in 
the antecedent at V 0 are printed with letters in boldface. 

Like relationships apply to formula parts of formulas, to operator 
occurrences in formulas, and to occurrences of predicate parameters. 
In Example 2, the first D in the bottom sequent has four ancestors, 
namely itself and the three D’s in boldface formulas; the first Q has 6 
ancestors, itself and the 5 boldface Q’s. 

In designating these relationships for formula parts (or wholes) of 
formulas, we use image (or ancestral image and descendant image) instead 
of “ancestor” or “descendant” when we wish to restrict the relationship to 
parts (or wholes) which are the same except possibly for substitutions of 
terms for variables (reading up) or inversely (reading down). Thus in 
Example 2 reading down, the boldface Q(a„) at K 00010 has as (descendant) 
images Q(a„), Q(a 0 ), Q(a„), Q(x), Q(x), Q(x), and as descendants Q(a 0 ), 


228 There could be doubt only when the A or the A of one of the sequents A — ► A 
contains several occurrences of the same formula, or with the rules V — r and — >3 when 
the A of the A of the sequent below the line contains congruent formulas VxA(x) and 
VyA(y), or 3xA(x) and 3yA(y). 
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Q(a„), P 3 Q(a 0 ), 3x(P 3 Q(x)), 3x(P 3 Q(x)), 3x(P 3 Q(x)) 3 (P 3 3xQ(x)). 
Reading up, P 3 Q(x) at V has as (ancestral) images P 3 Q(x), P 3 Q(x), 
P 3 Q(x), P 3 Q(a 0 ). For operator or predicate parameter occurrences, 
“ancestral image” and “descendant image” are synonymous with 
“ancestor” and "descendant”, respectively. 

A given composite formula occurrence in a sequent can serve as the 
principal formula for only one of the 14 (= 2-7) rules, according as the 
occurrence is in the succedent or the antecedent (2 choices) and according 
to the kind of the operator which it has outermost (7 choices). So we can 
classify the composite formula occurrences in sequents by the rules 
applicable to them as principal formulas. 

In treating the problem of falsifying a formula F via our search pro- 
cedure, the problem quickly generalized itself to that of simultaneously 
making in formulas A L , . . . , A m t and n formulas B„ . . . , B„ f, or 
equivalently (by the definition preceding Lemma 6) of making a sequent 

Aj A„, — * B x , . . . , B„ f (m, n > 0). In other words, A,, . . . , A,„ 

are to be satisfied and B x , . . . , B„ to be falsified, simultaneously; or 
Ai, . . . , A,„ — » B x , . . . , B„ is to be falsified. 

Now we are ready to adopt a plan to guide our systematic searches 
for counterexamples. 

First (Case (A)), we undertake to falsify a sequent 

E,, .... E*— » Fj, ... , F, where E t , . . . , E A , F x F ( are formulas 

in the sense of § 16, except that they may contain individual symbols 
(but not other function symbols). 2W By taking — ► F as the sequent, this 
includes the case of falsifying a single formula F. The sequent 
E„ . . . , E* — * Fj, . . . , F, may contain free variables, but those variables 
shall not also occur bound in it. 231 

Let u 0 , .... u,, be a list (possibly empty) of the free variables and 

individual symbols occurring in E„ . . . , E* — > F x F,. Let a 0 , 

a 2 , . . . be variables not occurring in E x , — , E t — > F x , ... , F,. The 
terms u„, . . . , u„, a 0 , a x , a 2 , . . . , as far as we activate them (i.e. make them 
ivailable), will be used as the b’s and r’s for our applications of the 
predicate rules — »V, V— », — >3, 3— >. Because the variables among 

j 0 u„ do not occur bound in E x , . . . , E fc — > F x , . . . , Fj, and a 0 , a x , 

t 2 , . . . are “new” variables not occurring in E x , . . . , E k — ► F x , . . . , F„ 

he substitutions with results A(b) and A(r) performed in using these rules 

2:10 We use E’s and F’s to allow “A,, . . . , A,„ — » B B„” to denote various 

cquonts in a tree having the fixed sequent E,, . . . , E* — * F,, . . . , Fj at the 

>ottom. 

2,1 The reader who is willing to agree that a search plan can be formulated (in this 
fase (A)) which makes Lemma 8 true may skip the details by passing to Lemma 
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will be free. 21 * 2 Those of u 0 , . . . , u„, a 0 , a 1( a 2 , . . . which we activate are 
intended to name distinct members of the domain D for the counter- 
example to E„ . . . , E t . — * Fj, . . . , F ; we are attempting to construct. 233 
As the search for a counterexample to E lf . . . , E fc — * F t , . . . , F, is 
pursued along any particular path in our sequent tree, we keep track 
step by step of which of u 0 , . . . , u ; „ a 0 , a 1( a g , . . . have thus far been 
actuated. To represent this concretely, we may employ a barrier |. We 
put the barrier into the list initially just to the right of u p thus, 

Uo, * • • i tip j Ug, aj, a 2 , . . . , 

to indicate that u 0 , . . . , u„ are activated initially, if u 0 , . . . , u„ is not 
empty. But if u 0 , . . . , u„ is empty, we place the barrier initially thus, 

a„ | a i> a 2 » ■ • ■ > 

to indicate that in that case a 0 is activated initially. In each step by — >V 
or 3— », we shall use a new variable as the b. Suppose that thus far 

u 0 , u,„ a 0 a,_! or briefly t 0 , . . . , t, (q — p + i+ 1) have been 

activated, so the list looks so: 

Ug, . . . , Up, a 0 , . . . , a,_i | a„ a, ; 4^2 

As the b for the —»V or 3— », we then use a i; at the same time moving 
the barrier over to indicate that this variable is added to the part of the 
list which has been activated: 

Uq, . . . , Up, Uq, . * . , a,-_j, a, [ a, +1 , a ( -^. 2 , • ■ •• 

The steps along any particular path of the tree will be grouped in rounds. 

Say that at the beginning of a certain round, call it Round d, we have 
before us a sequent A — * A or more explicitly A,, . . . , A„, — > . . . , B„. 

(For d = 0, this is E A . — * Fj, . . . , F,. For d > 0, it is the sequent 

reached at the end of Round d— 1.) 

In carrying out Round d, we take each of the formula occurrences 
A u . . . , A m , Bj, . . . , B„ in turn (or more precisely, after the first step, a 
proper ancestral image of that formula occurrence), and perform if 
possible one step or a finite number of steps with it as principal formula 
(unless part way through the round we close the path, as explained below). 

- :iz The exclusion of variables occurring free in H,, . . . , E t — » F, Fj from also 

occurring bound is necessary. Cf. Exercise 49.2. 

- M Lor a sequent l£ l- 4 — » F,, . . . , K, containing no variables free or bound, 

and no individual symbols, i.e. a sequent of the propositional calculus simply, we can 

alternatively forget about the list u„ u,„ a„, a t , a, s '- 2 The following discussion, 

omitting what is then irrelevant, gives a new treatment of the completeness problem 
for the propositional calculus. 
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No step can be performed in case the one of Aj, . . . , A m , B 1( . . . , B n 
n question is an atom. 

In the case of a formula occurrence of one of the kinds — O , . . . , 

>ne step is performed, which is completely determined by the formula and 
he corresponding rule. 

In the case of a formula occurrence of one of the kinds — »V, 3—*, one 
tep is performed by the respective rule, with the next variable a* after 
hose previously activated as the b (as explained above). 

In the case of a formula occurrence of one of the kinds V— >, — >3, steps 
ire performed by the respective rule using as the r each one in turn of the 
erms t 0 , . . . , t„ already activated which has not previously served as 
he r for that rule with the same principal formula (or more precisely, 
vith a descendant image of it as the principal formula). This calls for 
rom 0 to q + 1 steps. 

A particular path will be terminated and closed exactly when a sequent 
s first reached of the form (x) with C prime, disregarding as usual the 
>rder of formulas within antecedent and within succedent. 

A path will be terminated without being closed exactly when a sequent is 
eached from which no step as prescribed is possible. This will happen at 
i given sequent (with t 0 , . . . , t 4 the terms thus far activated) exactly when 
he sequent contains only atoms, and V— >- and -»3-formulas descendant 
mages of which have already served as the principal formula with each 
>f t 0 , . . . , t Q as the r. (This can only happen at the end of a round, in which 
ase there is no next round.) 

We can summarize our search plan thus. We provide for a non-empty 
loinain D by activating u„, . . . , u p or a„ initially. In each round, we go 
hrough the double list of formula occurrences A u . . . , A m , B 1( . . . , B„ 
eached at the end of the preceding round (or given initially), analyzing 
ach molecule in turn, with respect to its outermost operator only (with 
he terms t„, . . . , t, ; thus far activated as the r’s for V — > and — >3), before 
tarting over at the beginning of the list in the next round. 

Lemma 8. In a sequent tree constructed upward from 

E k —* F„ ... , F, using the 14 rules and the described search plan, 

onsider any unclosed path , terminated or infinite. The list t 0 t„ or 

„ t„ t 2 , . . . of the terms which are eventually “ activated ” along the path 
> not empty, and includes all the individual parameters of 
ij, .... E t — » Fj, .... F, and all the terms used as b’j and t’s for 
-»V, V— >, — >3, 3 — * along the path, and hence every individual parameter of 
ny sequent along the path.* 3 * Each molecule occurring (in the antecedent or 

33J Actually (in this Case (A)), for an infinite path it is easily seen that X 0 terms 
, t„ t s , . . . must be activated, or we would run out of steps to perform at a finite stage. 
This remark isn’t necessary for our purposes, and it won’t always be so in Case (B) §50.) 
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the succedent) in any sequent along the path is used as principal formula (in 
the antecedent or succedent respectively) somewhere along the path, namely, 
just once, except in the case of an V — *- or —*3-molecuIe, which is so used 
with each of t 0 , . . . , t„ or t 0 , t,, t 2 , . . . as the r. 

Proof. For a molecule occurrence not of the kind V— » or — >3, an 
ancestral image of it will be used as principal formula during the next 
round after the one in which the formula first appears (or in the case of one 
of E t , . . . , E„, I ,, . . . , F„ in the first round). An V— »- or — »3-formula 
occurrence, once it has appeared, will never disappear. An ancestral 
image of it will be called up lor use as principal formula, with every 
activated term not yet used as the r, in every subsequent round (and in the 

first round, if it is one of E, E*, F,, . . . , F ( ), either ad infinitum or 

until the path is terminated. The path can be terminated only when all 
activated terms have served thus as the r. — 

We picked the particular search plan described above to get Lemma 8 
quickly, toward proving Godel’s completeness theorem. In practice, the 
search for a counterexample to E», .... E* -* F 1; .... F, can often be 
conducted more efficiently (without sacrificing Lemma 8) by allowing 
some variation from the above plan. Examples 1 and 2 take a little 
longer to complete under the above plan than in §48 (Exercise 49.1 (a) 
and (b)).- :15 

Llmma 9. In a sequent tree constructed upward from 
E t , , . . , E ; . — * Fj, . . . , F, using the 14 rules and the described search plan, 
corresponding to any unclosed path, terminated or infinite, there is a counter- 
example to E 1( . . . , E 4 — * Fd . . . , Fj with the domain D = {0, ... ,q) or 
D = {0, 1,2,...} according as only t 0 , ... ,t Q or t 0 , t t , t 2 , . . . are activated 
along the path. 

Proof. Let the sequents along such a path be A # — » A 0 , . . . , A, — > A, 
(terminated unclosed path) or A 0 — * A 0 , Aj — » A lt A 2 — » A 2 , . . . (infinite 

path), where A 0 — ► A„ is E x E fc — » F„ . . . , F t . Let UA be the set of 

all the formulas which occur in any of the antecedents A 0 , . . . , A t or A 0 , 

al3 In our search plan, when u„, . . . , u, is empty, we activate a 0 by fiat at the beginning 
to be sure of having it. But it is more economical to wait then for a 0 to be activated by 
the lirst — »V or 3 — >, when one of these can be reached before we need to perform an 
V— f or — >3 ; along any unclosed path in which we can’t do this, a„ should be activated 
by tiat (before termination). Steps may be performed in any convenient order for a 
while, provided that a routine like the one in our plan is instituted eventually along each 
infinite path. In applying — FV or 3— ► to a sequent in which one of the previously acti- 
vated variables does not occur, that variable may be used as the b instead of a newly 
activated variable. In applying V — > or — >3 when the A(x) does not contain the x free, 
it is sufficient to use only one r. Also cf. the “first approach” in Example 3 §48. 
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Aj, A 2 , . . . , and similarly let uA be the union of all the A’s. We shall 
show that, with the domain D as described, we can pick an assignment in 
D to make all the formulas in UA (including E„ . . . , E t ) t and all those 
in uA (including F„ . . . , F,) f. 236 

Because the path is not closed, UA and uA contain no atom C in 
common. For, suppose they did; say C first enters the antecedents in A 0 
and first enters the succedents in A fc . Once an atom has entered an 
antecedent or succedent, it rides up into all higher sequents in antecedent 
or succedent respectively in the extra formulas T or 0 of the steps. So C 
would be in both A,, and A c where c = max(a, b) (the greater of a and b, 
or their common value). So the path would have been terminated and 
closed (by use of ( x)) at A c — * A c (if not before). 

It will follow that, with the domain D as described in the lemma, we can 
pick an assignment to {at least) all the parameters of UA and UA which 
makes all atoms in uA t and all in UA f. As the parameters, we take (a) 
the variables and individual symbols t 0 , . . . , t„ or t 0 , t 1( t 2 , . . . , (b) the 
proposition symbols occurring in uA or UA, and (c) the other predicate 
symbols so occurring. We take D = (0, . . . , q} or D — {0, 1, 2, . . .} 
respectively, and we assign to t 0 , . . . , t„ the values 0 , ,q, or to t 0 , 
t 1( t 2 , . . . the values 0, 1, 2 To each of (b) we assign t if that prop- 

osition atom occurs in uA, f if it occurs inuA(we have seen that it 
can’t occur in both), and an arbitrary value say f if it occurs in neither. 

To each of (c) we assign the logical function I such that \{x 1 x„) is t if 

the predicate atom P(t, , . . . , t^) occurs in uA, f if P(t e , . . . , t x J occurs 
in UA (it can’t occur in both), and say f if it occurs in neither. 64 Each 
predicate atom in UA or UA is of the form Pft* , . . . , t x ) for some 
x„ . . . , x n belonging to D, using the second sentence of Lemma 8. 
Hence, all atoms in uA will now be t and all in UA will be f, as required. 

Finally, we show that the D and assignment picked to make all atoms in 
UA t and all in UA f also makes all molecules in uA t and all in uA f. 
Suppose it did not. Then, from among all the molecules in uA which are 
not t or in UA which are not f, let us choose one G containing the smallest 
number (> 1) of occurrences of operators. By the last sentence of Lemma 
8, G plays the role of principal formula (for the antecedent or succedent 
rule appropriate to its outermost operator, according as G is picked from 
uA or from uA). For G not an V — *- or — »3-formula, consider the side 
formula(s) H, or H and I, which are in the premise belonging to the path 
in question. By the choice of the 14 rules except V— * and — >3, the formula(s) 

"* For a terminated unclosed branch, it is sufficient (by the rationale of our searches, 
or specifically by Lemma 6 (b)) to make just the formulas in the top sequent have the 
right values (as illustrated in Example 1). However, often the best way to see that we can 
do this is by the reasoning we give here. 
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H, or H and 1, having the desired value(s) (t in the antecedent, f in the 
sucecdent) is sulTicient for G to have the desired value. (This property of 
the rules gave us Lemma 6 (b).) But H, or H and I, having fewer occurrences 
of operators than G, do have the desired value(s), since G was supposed 
to have the minimum number of operator occurrences for formulas not 
having the desired values. So such a G cannot exist, unless it is an V— ♦- or 
—♦3-formula, i.e. VxA(x) in the antecedent or 3xA(x) in the succedent. 
Then by end Lemma 8, A(r) occurs as side formula for each of t 0 , . . . , t, 
or of t 0 , L, t 2 , . . . as the r. But under our assignment these variables name 
all the members of the domain D. So G must get the desired value, in 
consequence of all the side formulas A(t 0 ), . . . , A(t,) or A(t 0 ), A(ti), 
A(t 2 ), . . . getting the desired values (since they each have one less operator 
occurrence), contradicting the choice of G. — 

In the following lemma dealing with geometric trees, we mean by a 
partial path a succession of vertices connected by following the arrows, 
starting with the initial vertex V. (By a path, ,we have meant such a 
succession of vertices continued either to termination or ad infinitum.) 

Lfmma 10. (Konig’s lemma 1926.) 237 In a geometric tree with finitely 
many arrows leading from each vertex, if there are arbitrarily long finite 
partial paths, then there is an infinite path. 

Proof and illustration. In the application we are to make, the 
number of arrows leading from a vertex can be 0, 1 or 2. Very simple 
geometric trees of this kind are shown in Examples 1-4; here is a more 
complicated such tree (drawn horizontally to save space): 



To give the proof in general, suppose we are dealing with a tree as 
described in which there are arbitrarily long finite partial paths. We wish 

- 1 ' In other forms this is in Brouwer 1924. and implicitly in Brouwer 1923a and Skolem 
1922-3 p. 222. (T. K'.eene and Veslcy 1965 p. 59. 
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to trace an infinite path. Here is the rule for doing so. Suppose we have 
traced the desired path as far as vertex V x (either the initial vertex V, 
or a later vertex such as K uoo in the illustration), and that V x belongs to 
arbitrarily long finite partial paths (as does V by hypothesis). We want to 
pick a next vertex that will also have the italicized property. There must be 
a next vertex after V x , or V x could not have the property; say the next 
vertices are V xo , .... V Xn . One at least of these next vertices must have 
the property; for if the partial paths through V xo , . . . , V Xn were no 
longer than . . . , b n vertices respectively, then all partial paths through 
V x would be no longer than max(Z> 0 , . . . , b n ) vertices. So we can indeed 
pick a next vertex after V x also having the property. Altogether, starting 
with V, we are thus able to pick a next vertex, always with the property, 
ad infinitum. 

In the illustration above, if the boldface vertices are the ones which 
belong to arbitrarily long finite partial paths, then one infinite path 
(boldface arrows) starts out f / K 1 f / n t / 1 , u K UO0 k' 110l)1 • • •• 

Example 6. For trees in which infinitely many arrows may lead from 
a vertex, the lemma need not hold. Consider the tree in which X 0 arrows 
each lead from V to a next vertex V x ( x = 0, 1,2,.. .); and from V x 
successive arrows lead to x more vertices V xW , . . . , thus : 


V Vo 



In this tree, there arc arbitrarily long finite partial paths, but no infinite 
path. — 

Now, given a sequent E t , .... E* — > F 1( ... , F ( as specified for Case 
(A), let us start constructing the sequent tree upward from it, using the 14 
rules and the described search plan. The succession of steps along any 
path is governed by the search plan. Let us divide our work between the 
different paths, when there is branching, so that different paths are 
constructed up to corresponding levels in step. Thus, after having con- 
structed up to their 10th vertices all partial paths which don't terminate 
sooner, we take each of these which doesn’t terminate at its 10th vertex 
and add the one or two 1 1th vertices issuing from it, before we put a 12th 
\ertcx on any path. 
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CASii 1 ; for some b, each path terminates and closes with not more 
than b vertices. Then the tree itself is closed, and finite (it has at most 
1 + 2 + 2' J + . . . + 2 6-1 = 2 b — 1 vertices). So there is no counter- 
example, i.e. 1= E x , . . . , E* — ► F x , . . . , F|. (To review the argument, 
already illustrated by Example 2, at each partial stage in the tree con- 
struction, our sole hope of getting a counterexample is that one of the 
seq cents that stand at the tops of branches can be falsified; indeed, we 
know this by repeated applications of Lemma 6 (a). But with the whole 
tree closed, each sequent at the top is not falsifiable, by Lemma 7.) 

Case 2: for some h, some path terminates unclosed at its />tli vertex. 

Then by Lemma 9, there is a counterexample to E x , . . . , — > F 1; , . . , F, 

with D = (0, . . . , q}. (Along a terminated path, only finitely many terms 
t 0 , . . . , t„ can be activated in this Case (A).) 

Casi: 3; otherwise. Then the tree has arbitrarily long finite partial 
paths. So by Konig’s lemma (Lemma 10), there is an infinite path. 

Hence by Lemma 9, there is a counterexample with D = (0 q) or 

D = {0, I, 2, . . 

We have now' reached our goal for this section. To simplify the statement 
(italicized next below), wc can take D — (0, 1, 2, . . ,} in both Cases 2 
and 3. For, if there is a finite counterexample, we can manufacture a 
countably infinite one from it by splitting an element into elements, all 
behaving like the original clement in the evaluation process. (We used 
this idea, splitting an element into 2, in §§ 30, 48.) 

Preliminary version of Theorems 33 and 34°. For a sequent 
E,, . . . , E k — * F„ . . . , F, as described for Case (A) ( containing no variables 
both free and bound): Either (1) there is a counterexample to 
Ej, . . . , Ej. — * F x , . . . , F, in the domain {0, I, 2, . . .}, or (II) there is a 
( finite ) closed sequent tree constructed upward from 
Ei, ... , E t — > F\, . . . , F, by the 14 rules, and hence there is no counter- 
example to E 1; . . . , E t -* F lf . . . , F„ i.e. k E„ . . . , E t — » F„ . . . , F,. 

It follows as a side result that E t , ...» F x , ...» F, cannot have 
uneountably infinite counterexamples and no others: 

Preliminary ve:rsion of Theorem 35. For a sequent 
Ei, ... , I v, — > Eq, . . . , F, as described for Case (A): If there is any counter- 
example to El . . . , E k — f Fj, . . . , F„ then there is one in the domain 
{0,1,2,...]. 

Exercises. 49.1. Construct the sequent tree upward (using the 
described search plan), and conclude (I) (give the counterexample) or (II): 

(a) -» 3x(P D Q(x)) 3 (P 3 VxQ(x)) (Example 1). 

(b) -> 3x(P D Q(x)) D(P3 3xQ(x)) (Example 2). 
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(c) -» “1(0 & ~ iQ & Vx3yP(x, y)) (Example 5). 

(d) — > P & 3x(Q(x) D Q(x)). (e) 3xP -> VxP. 

;f) Vx3yP(x, y) -> Vx3yP(x, y). (g)PDQ-»PD Q. 

49.2. Apply the search plan to VaVc(P(c) & Q(a)) — * Q(b) and 
\/aVb(P(b) & Q(a)) —* Q(b). Why does it fail for the latter? 

49.3*. Consider making a systematic search for a counterexample to 
-i{Vx~iP(x, x) & VxVyVz(P(x, y) & P(y, z) D P(x, z)) & Vx3yP(x, y)} (Ex- 
imple 4 § 48), by using the 14 rules upward with the following search 
alan (different from the plan in the text): in Round 0, perform all steps 
aossible with only a„ activated; in Round 1, activate a x by 3 — * and then 
perform all new steps possible with only a 0 , a! activated; in Round 2, 
ictivate a 2 by 3— > and then perform all new steps possible with only 
t„, a„ a 2 activated ; etc. 

a) Show that Lemma 8 is satisfied by this search plan. 

b) Show that, if the steps are continued to the end of Round d suspending 
he rule for closing paths, there will be 3(r/+l) 3 “branches” or partial 
Daths. (Thus, activating only a 0 , a l5 a. as in the informal discussion in 
Example 4, there will be 81 paths, if the possibilities for closing some of 
hose paths before completing all the disections are disregarded.) 

c) Using as a guide the counterexample which we found by insight in 
Example 4, pick at each branching the sequent along a path which goes 
vith that counterexample. Write the sequent which will be reached at the 
■nd of Round 1 along the resulting infinite path. 

§ 50. Godel’s completeness theorem with a Gentzen-type formal 
ystem, the Lowenheim-Skolem theorem. Now we would have 
he first case of Godel’s 1930 completeness theorem (lf¥ F, then F F), if 
nder (II) just above (for k = 0, / = 1) we could infer that V F in the 
ense of § 21 . (We shall do so in § 51 .) 

However, we already have the basic discovery. This is that, whenever 
o counterexample to F exists (i.e. when f= F), that fact can be confirmed by 
finite mechanical verification process. 238 This process (in the present 
eatment) consists in verifying that a certain finite figure meets the 
jnditions for being a closed sequent tree constructed upward from — * F 
sing (he 14 rules. 

From the standpoint from which formal systems and proof theory were 
ivented (§ 37), this fundamentally is all we were after. We could take the 
icchanical verification process as itself constituting a proof of F. 

In fact, these verifications come under essentially the traditional form 

aM Using a different mechanical verification process than the present one, this basic 
scovery is quite clearly contained in Skoleni 1922-3 pp. 220-222 (which was not known 
Godel in 1930). Cf. Footnote 252 below. 
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for axiomatic-deductive systems, if we simply read downward in checking 
the correctness of the trees, instead of upward. At the top of each branch, 
if the tree is closed, we have a sequent fitting ( x) in § 48, which we can 
now consider to be an axiom schema. (Now “x” instead of meaning 
“closed” can mean “axiom".) And each step downward is by one of 14 
rules (previously read upward), which we can now regard as one- or two- 
premise rules of inference. (Now “D — >” can be called “D-introduction in 
the antecedent”, etc.) 

Now any tree with axioms by (x) at the tops of branches, and each 
step downward by one of the 14 rules, constitutes a proof (oj its bottom 
sequent or endsequent) in a formal system G4 of a new type, called a 
Gentzen-tvpe ( sequent ) system or sequent calculus. 239 Such systems were 
introduced by Gentzen 1934—5 (and 1932), in part following Hertz 1929. 
In contrast, we call the formal system for the predicate calculus of §21 
a liilberl-tvpe system H. More precisely, “G4” and "H" denote ambig- 
uously several systems, according to how the notions of term and 
formula (specifically, of prime formula) are regulated (cf. §§ 37, 39). 

By Lemmas 6 (a) and 7, G4 has the consistency property analogous to 
Theorem 12, M for //: 

Theorem 33. Each sequent A^ .... A m —► Bj, , B„ provable in 
<74 is valid ; in symbols, if P A„ . . . , A m — » B^ . . . , B n , then 
r Aj, . . . ,A,„ * B|, . . . , B„. 

This is only a restatement of the feature of our searches that closure of 
the tree means that all possibilities for finding a counterexample have 
been closed out. Without this consistency property, we would hardly want 
to use G4 as a formal system. 

Lemma 6 (b) expresses a novel property of (the rules in) G4, not 
possessed in H by the rule of modus ponens. 240 

The fact that G4 operates with sequents, so — > F (rather than F) is 
provable in G4 when the search procedure closes, is not a serious drawback. 
Anyone who objects to this could easily modify G4 to use formulas as 
did Schiitte 1950 (though the sequents are convenient), or he could 
supplement G4 by a rule — > F / F. 

Also G4 has the new feature that its proofs are finite trees (they are 
“proofs in tree form”) rather than finite (linear) sequences of formulas 
(“proofs in sequence form”). We could rewrite the trees in sequence form. 

■ n Ge. 1 t/.en-typc systems Cl , G2, G3, (73a are discussed in IM §§ 77-80, which wc shall 
cite more specifically in § 54. The present system G4 is very close to C 3, to the system L 
of Beth 1459 p. 282, and to LC in Ranger 1957. The first 8 rules coincide with the 
propositional rules of Ketonen 1944. 

G3 has the feature, but not Gl, G2, G3a. 
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But the trees show the logical structure better, and thus help us in our 
reasoning about that structure. The linear arrangement of proofs has 
been traditional, no doubt because oral language is necessarily linear, 
and written language more conveniently so for ordinary purposes. Proofs 
(and deductions) in H can be written in tree form. 241 

Now we take up some other cases of Godel’s completeness theorem. 

We begin (Case (B)) by extending the treatment of Case (A) to deal 
with a countable infinity of formulas . . . , E 2 , E 1; E 0 to be made t and a 
countable infinity F„, F lt F 2 , . . . to be made f, simultaneously, in which 
altogether there occur only a finite list u 0 , . . . , u, (possibly empty) of 
free variables and individual symbols (but no other function symbols). 
Such free variables shall not occur bound in any of the formulas. 
Alternatively, one of the two lists of formulas may be finite or even 
empty. 242 For those alternatives, slight alterations of our notation are to 
be imagined. 243 

To deal with this case, we generalize our notion of sequents and 
related notions to allow N 0 formulas in the antecedent and succedent (or 
in one of them). We now work with the X 0 -sequent 
. . . , E 2 , Ej, E 0 — > F 0 , F u F 2 , . . . instead of with the sequent 
Ei> Fj, ... , Fj. 244 

However, during Round d along any particular path in the tree only the 
first d + 1 formulas of the original lists will have been activated. Say that 


• ■ - > E(Z+2> E<i | * * ■ ) ^ii 


®n I Fd+1> F<i+8> • • ■ 


2)1 Cf. IM pp. 106-107. In IM “branch” is used where we have been saying “path”, 
with both Hilbert-type and Gentzen-type systems. (Contrary to botany, a “branch” 
starts from, or reaches to, the bottom, not just from the lowest “branching”.) The term 
in Hilbert and Bernays 1934 is “proof thread (Beweisfaden)”. 

212 One of the interesting cases is that . . . , E 2 , E„ E„ are the closures of the axioms of a 
formal system like N of § 38, and instead of F 0 , Fi, F 2 , . . . we have a single formula F. 
We shall review this application at the end of § 52. However, it is not required for the 
aresent treatment that the formulas . . . , E 2 , E,, E 0 (or F 0 , F lt F 2 , . . .) be given effectively, 
is the axioms of a formal system should be (cf. §§ 37, 43). (For the result mentioned in 
Footnote 228 with infinitely many formulas, the lists must be effective, so that E, 
and F, can be found effectively from /.) 

243 In this book we do not attempt to consider languages for the predicate calculus 
•vith uncountably many symbols, and hence uncountably many formulas. Such languages 
ire highly non-constructive. However, Malcev 1936, Henkin 1950, A. Robinson 1951, 
:tc. have studied them, and found model-theoretic applications. Propositional calculus 
with uncountably many symbols is allowed in Godel 1931— 2b. 

241 The reader who skipped the details of the search plan in Case (A) should observe 
}ne new point (after which he can pass to Cases (C) and (D)): If the tree is closed, and 
hus finite, then only finitely many of . . . , E a> E,, E 0 and F 0 , F,, F 2 , . . . (say at most 
£j, . . . , E 0 and F 0 , . . . , FJ can have been taken into account in constructing it by 
he 14 rules and recognizing it as closed by the axiom schema (x). 
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is the X 0 -sequcnt reached at the end of Round cl— 1 (or when cl = 0, the 
initial X 0 -sequent, with A t , . . . , A m and B lt . . . , B„ both empty). Then 
we commence Round cl by first moving out the two barriers in the X 0 - 
sequent before us to activate E (i and F„, thus, 

■ • • > | A i> • ■ • » ^ ^i» * * * * F^ [ f-i» F rfli , .... 

Then each of E,„ A t , . . . , A,„, B l5 . . . , B,„ F rf in turn is used to the 
extent possible as principal formula (as A„ . . . , A,„, B x , . . . , B„ were 
in Case (A)). The criterion for closing a path js that the part of the 
N 0 -sequent between the barriers (which is a sequent) fit the axiom schema 
( x), i.e. have an atom common to its antecedent and succedent. After any 
round with the path not closed, there is always a next round in which at 
least the barriers are moved one position out. If the formulas E ( , and F d 
newly activated are atoms, it can happen that there are no steps in that 
round (i.e. no sequent(s) are added to the tree); namely, this will happen 
if the sequent previously between the barriers qualified for termination 
under Case (A). If, in this case, alt the formulas outside the barriers are 
atoms, then termination without closure will take place by having an 
infinity of rounds happen “instantaneously” (the barriers being simply 
moved out, one position after another). This is the only way termination 
without closure can take place. Closure, if it takes place, occurs in some 
finite round. 

No change is required in Lemmas 8 and 9 other than replacing 
“sequent” by “X 0 -sequent” and “E x , . . . , E fc — ► F x , . . . , F,” by 

. . , E 2 , Ej, E 0 — » F 0 , Fj, F 2 , . . .” (and Lemma 10 does not depend on the 
case). But in conclusion, if we obtain a closed X 0 -sequent tree, then from it 
we can obtain a closed sequent tree as follows. Since the X 0 -sequent 
tree is closed, it is finite (as argued in Case 1 end § 49). Consider its 
finitely many top vertices, and let d be the greatest round during which 
any of them become closed. Then nowhere in the tree do the two barriers 
stand further out than just right of (an ancestral image of) E^ in ante- 
cedent and just left of F rf;1 in succedent. The pairs of atoms C in top 
X 0 -sequents which occasion their closure are within the barriers. So are 
the principal and side formulas in each step. Now let us move the barriers 
out in every X, r scquent up to but not across E, ;i t and F (<l t , and then throw 
away ail formulas still outside the barriers. The result is a closed sequent 
tree with L, ( , . . . , E 0 — * F„, . . . , F rf at the bottom. 

Finally, we take the cases not already treated of finitely many formulas 
E t , . . . , E a , ,F„ . . . , F, (Case (C)), or X 0 formulas . . . , E 2 , E 1( E 0 , F 0 , 
Fj, F,, . . . (Case (D)), which in the aggregate contain (i) finitely or 
infinitely many free variables (which shall not also occur in any of the 
formulas bound) and individual symbols and (ii) finitely or infinitely many 
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other function symbols. 245 We write out the treatment supposing there 
are at least one each of the symbols (i) and (ii). Otherwise, slight modifica- 
tions are to be imagined. 246 

To deal with these cases, we prepare X 0 separate lists of X 0 terms each, 
which we show with the initial position of the barrier in each: 

u 0 I Uj, u 2 , u 3 , . . . , 

| a 0 , u 01 , u 02 , u 03 , . . . (where a 0 is u 00 ), 

| a x , Uh, u 12 , u 13 , . . . (where a x is u 10 ), 

| a 2 , u 21 , u 22 , u 23 , . . . (where a 2 is u 20 ), 

The first list u 0 , u x , u 2 , . . . is an enumeration of all the terms (§§ 28, 38, 39) 
constructive using the symbols (i) and (ii). We can always get an 
enumeration of the terms thus constructible, e.g. by the method of digits 
((A) in § 32). For example, if the formulas contain no free variables but 
exactly one individual symbol e and two other function symbols f( — ) and 
g( — , — ), the enumeration could start out thus: 

e, f(e), g(e, e), f(f(e)), f(g(e, e)), g(e, f(e)), g(f(e), e), . . .. 

Each subsequent list u i0 , u n , u, a , ... is an enumeration of the additional 
terms which are constructible when the variable a, is also made available. 
Thus u, 0 , u a , u, 2 , ... is an enumeration of the terms constructible 
allowing the use of the same symbols as for u„, u 1( u 2 , . . . and the variables 
a 0 , . . . , a, and actually using a,- (so we don’t include any terms from 
earlier lists). For example, if the symbols for u 0 , u l( u 2 , . . . are as above, 
u 10 , u n , u 12 , . . . could start out 

* 1 , f(a x ), g(a 0 , a x ), g(a x , a 0 ), g(a x , a^, f(f(aj)), f(g(a 0 , a x )), . . .. 

When a step by — »V or 3— » is performed, we use as the b the first one 
a, of a 0 , a x , a 2 , . . . not yet activated, and we move the barrier over one 
place in the list u i0 , u a , u i2 , . . . which it begins to record this. In V— »- 
and — »3-steps, all the terms t 0 , . . . , t f thus far activated in any of the 
separate lists are available as the r. The rounds are managed as under 
Case (A) or (B), except that, after the end of any round at the commence- 
ment of the next, we move the barrier one place to the right in each of 
the lists of terms in which the barrier is not at the extreme left. Thus, 

1,5 The reader who has been skipping the details of the search plan may pass to the 
remarks on Lemmas 8 and 9. 

If there are none of (ii), the list a„ u,,, u, 2 , u, a , . . . reduces to a, (i = 0, 1, 2, . . .); 
then there must be x 0 of the symbols (i), or we would be back in Case (A) or (B). If 
there are some of (ii) but none of (i), the list u„, u,, u 2 , . . . disappears, and a 0 should be 
activated initially. 
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along any path in the tree which does not close, once any term in a list 
is activated, we eventually activate every term in the list; they all go into 
the consolidated list t„, t„ t,, . . . of all the terms eventually activated. 
This will happen even in a terminated but unclosed path, by infinitely 
many rounds being performed “instantaneously” by just moving barriers 
o\er in the lists of terms and for Case (D) also in the lists of 
formulas. 

The second sentence of Lemma 8 now reads: The list t 0 , t t , t s , . . . of 
the terms which are activated along the path is an enumeration of all the 
terms eonstructih/e using the individual and other function parameters of 
E,, . . . , E fc — » F,, . . . , F, or of . . . , E a , E t , E 0 — » F„, F,, F a , . . . and those 
of the new variables a n , a 1( a 4 , . . . which are introduced in using — »V and 3— »; 
every r for V — * ot — >3 is taken from the list', and hence every term occurring 
free (§ 28) in any sequent along the path is in the list. 

For Lemma 9, we shall arrange matters so that each of the activated 
terms l 0 , t l5 1,, . . . stands for a different member of D. Thereby we will be 
free to make all predicate atoms P(tj. , .... t, ) in UA t and all in uA f. 
(We only aim to describe some counterexample, as quickly as we can.) 

So we take D — { 0, 1, 2, . . and we shall make the assignment to the 
parameters in the terms so that the values of t 0 , t 1; t 2 , . . . are precisely 
0, 1, 2 

To accomplish this, first consider each one e of the free variables or 
individual symbols; it occurs just once in the list t 0 , t 1; t a , . . . , say as t,. 
We give e the value/. Now consider each one of the other function symbols; 
a 1-place function symbol f( — ) will serve for illustration. Each of the 
terms f(t 0 ), f^), f(t 4 ), . . . occurs just once in the list t 0 , t t , t 2 , . . . ; say 
they occur as t, , t, , t f , . . .. Then we evaluate f( — ) by the function / 
such that /(()) = /„,/( 1) = /*!, /( 2) = i 2 

Now' consider the process of evaluating a term (§28). For a given 
assignment, we evaluate its term parts successively from inside out, 
starting with assigned values of variables and individual symbols, and 
applying assigned values of functions. But under the assignment just 
described, at each successive stage (including the last one) the term part t 
in question is evaluated by the number / such that t is t r 

The choice of truth values and logical functions to evaluate the propo- 
sition and predicate symbols can now read as before. 

For the proof that all molecules in uA must then be t and all in UA f, 
we review only the case of an V— : ►-formula (that of an — »3-formula is 
similar). Say e.g. G is the V— ^-formula Vx(P(f(x)) & Q) in uA and gets 
the wrong value f, while all 1 -operator (and 0-operator) formulas in uA 
and uA get the right values. But then by Lemma 8 all of P(f(t 0 )) & Q, 
P(f(tj)) & Q, P(f(t 2 )) & Q, . . . will be in UA as side formulas to G, and so 
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have the right values, namely all t. Then Vx(P(f(x)) & Q) is t (contra- 
dicting our assumption); for, consider any x. When x has the value x, the 
value of f(x) is the value of f(tj, so the value of P(f(x)) & Q is that of 
P(f(tJ) & Q, which is t. This is for any x\ so the supplementary table for 
P(f(x)) & Q has a solid column of t’s; so Vx(P(f(x)) & Q) is t. 

Taking the main conclusion we reached at the end of §49, restating it 
using the idea of a proof in <74 (beginning this section), and adding the 
new cases (B)-(D), we have the following theorem. 


Theorem 34°. (Godel’s completeness theorem with a Gentzen-type 

formal system (74.) Let j^ 1 ’ ‘ ' ' ’ p*’ p 1 ’ ‘ ‘ ’ ’ j be formulas of the 

1 > *-'U> * 1> * 2» • • • j 

predicate calculus containing bound no variable which occurs in any of 

them free. Either (I) j^i* • ■ Ey E x , ‘ ‘ ‘ ) is falsifiable in 

J |. . . , E 2 , El E 0 -> F 0 , F„ F 2 , . . ./ 

the domain of the natural numbers {0, 1, 2, . . .} (“N 0 -falsifiable”), or 

m\ (Eii • • • > E* F x F, 


for some d, E d , , . . , E 0 —* F 0 


In Cases (A) and (C) (the upper version in Theorem 34), when (II) holds, 

Ej E k — » F„ . . . , F, is not falsifiable. (This part of our conclusion 

in §49 has meanwhile been stated separately as Theorem 33.) Hence, if 
Ej, . . . , E* — * F x , . . . , F, is falsifiable, it is falsifiable in {0, 1, 2, . . .}. 

Using this for J — 0, we have the (a) part of the next theorem. For, the 
sequent E„ . . . , E k — » is falsifiable in a given domain exactly if the 
formulas E x , . . . , E* are simultaneously satisfiable in that domain. 

In Theorem 35, we do not need to keep the condition that the formulas 
contain bound no variable which occurs free in any of them. For, other- 
wise we could first replace the given formulas by formulas congruent to 
them (§ 16) and satisfying the condition, without changing their truth 
tables for any domain and assignment. Then, after applying the theorem 
with the condition met, we could change back to the original formulas. 


Theorem 35. (The Lowenheim-Skolem theorem.) 247 In the predicate 
calculus: 

(a) (After Lowenheim 1915, Skolem 1920.) If E is satisfiable, it is 
^-satisfiable. If E x , . . . , E k are simultaneously satisfiable, they are simul- 
taneously H 0 -satisfiable. 


247 These versions (except “compactness” in (b)), for Cases (A) and (B), can be 
established by the reasoning in Skolem 1920 plus the trivial argument (end § 49) that, 
if we come out with a finite domain, an element can be split into x 0 elements. More 
details of the history are in Footnote 252 below. 
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(b) (After Skolem 1920.) If E 0 , E x , E 2 , . . . are simultaneously satisfiable 
f or even if for each d, E 0 , . . . , E^ are simultaneously satisfiable (“com- 
pactness”, Godel 1930)], then E 0 , E x , E 2 , . . . are simultaneously K 0 - 
satisfiable. 

Prooh of (b), supposing E„, E„ E.>, . . . contain bound no variable 
which occurs free in any of them. We shall apply Theorem 33, and the 
lower version of Theorem 34 omitting F () , F,, F 2 , . . .. 

Suppose that, for each d, E E ( , are simultaneously satisfiable. 

This means that, for each d, there is a respective domain D d and assignment 
in If which makes the formulas E,„ . . . , E d all t, and thus makes the 
sequent E,„ . . . , E 0 — * f, so that not t= E^, . . . , E 0 — >, and hence by 
Theorem 33 not h E,„ . . . , E 0 — > in 04. Since this is for every cl, (II) of 
Theorem 34 is excluded. So by the remaining alternative (I), 

. . . , E 2 , Ei, E„ — » is falsifiable in (0, 1,2,.. .}, i.e. E 0 , E x , E 2 , . . . are simul- 
taneously satisfiable in {0, 1,2, . . .}, or briefly they are simultaneously 
X„-satisfiablc. 

Exercises. 50.1. (a) Establish the following for m, n > 0: 

(1= A„ . . . , A,„ -» B„ . . . , B„}^ {h -> A, & . . . & A m 3 B x V . . . V B„}, 

Aj, . . . , A,„ -* B x B,,} A x &...&A m DB 1 V.„V B„}, 

assuming for the fourth implication that no variable occurs both free and 
bound in An ... , A m -» B x , . . . , B n . State and prove like propositions 
(b) for m — 0 & n > 0 and (c) for m > 0 & n— 0. 

50.2. Establish the following additional versions of the Lowenheim- 
Skolem theorem. IfE u . . . , E t . — * F 1; . . . , F ; or 
. . . , n 2 , E„ L„ — > F 0 , F„ F 2 , . . . is falsifiable , it is X 0 -fa/sifable. 

If X„-!= F, then 1= F. If E x , . . . ,E X . X 0 -f F, then E X) . . . , E* h F. 

§ 51. Godel’s completeness theorem (with a Hilbert-type formal 
system). To obtain Godel’s completeness theorem in the form “ If 1= F, 
then b F”, it remains for us to establish “If I — » F in (74, then L F in H". 
We get this in Corollary Theorem 36, by a lengthy but straightforward 
exercise in the proof theory of H. 

Tiii ori m 36. If f A„ . . . , A m — * B„ . . . , B„ in the predicate calculus 
(74, then A,, . . . , A,,,, — iB,, .... nB„ L P & ~tP in the predicate calculus H. 

Li MM A 1 1. (a) In II, for n > 0, 

{Aj, . . . , A m , -iB x , "iB,, I- P & ~iP}5±° 

{A x A w , — iB x , . . . , ~>B m _i h B„}. 

(b) In II. fur ni > 0, 

{A x A„ , ~iB„ .... — iB, f P & ~ iP} 

{A.,, . . . , A,„, — iB,, . . . , ~iB n L — iAJ. 
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Proof of lemma. The four implications follow respectively from (1) 
-i-introd. and (double) - i-elim., (2) weak -i-elim., (3) — i-introd., (4) weak 
-l-elim. (cf. Theorem 13 § II). We do (1) in detail using the format 
(A) § 13 (employing Theorem 9 tacitly). 


1. 

A„. 

A 

• » 

iB 1( . 

• - “iB„ 

b 

P 

& — iP — by hypothesis. 

2. 

A„. 

• ’ 

~ 'Bj, . 

• . "" 'B„ 

b 

P 

— &-elim., 1. 

3. 

A„. 

A 

• » r *m* 

-iB„ . 

• . "iB,, 

b 

— iP — &-e!im., 1. 

4. 

A„ . 

• * ^ m 

— >B„ . 

• , "~ 1 B„ 

-l 

b 

— i — iB„ — — i-introd. 2, 3. 

5. 

A„. 

A 

* » 

-iB„ . 

• , ~iB„. 

-l 

b 

B„ — -i-elim., 4. 


Proof of theorem. Suppose given a proof of A„ . . . , A ,,,-* B lt . . . , B„ 
in G4. Such a proof is in tree form (§ 50). We shall show that, starting at 
the tops of branches and working down step by step, we can establish, for 
each sequent A — > A in its turn, that A, ~iA bP& — iP, where if A is 
L lt . . . , L, then ~iA is — iL t , . . . , - iL, (and if A is empty, so is ~iA). 

We have 15 cases to consider, according as the sequent A — * A we are 
considering is at the top of a branch (in which case it is an axiom by ( x )) 
or results by a downward step (an “inference”) by one of the 14 rules. We 
give 5 cases, leaving the others to the reader (Exercise 51.1). 

Case (x): A— » A is an axiom, i.e. is of the form C, T — » 0, C. 

So “A, -iA I- P & -iP” is “C, T, -10, “iC b P & “iP”, which is true by 
weak ~i-elim., or using Lemma 1 1 (a) and Theorem 9 (i), 

Case — O : A — > A (of the form T — ► 0, A 3 B) comes from — > Aj 
(of the form A, F — » 0, B) by — >Z). We are establishing the desired 
property of the sequents in the tree by working downward step by step. 
So we do not have to deal with this sequent A — * A (by establishing 
“A, ~iA b P & -iP”, i.e. ‘T, -10, ~i(A D B) I- P & “iP”) until we have 
already handled A 1 — >A t (by establishing “A 1( — ]A t h P & “ iP”, i.e. 
“A, I\ “10, -iB b P & — iP”)^ By Lemma 11 (a), our problem reduces to 
inferring “i\ — 10 1 A 3 B” from “A, F, -|0 b B”. This is immediate by 
the deduction theorem (Theorem 11, or U-introd. in Theorem 13). 

Case o — >. From (a) T, -10, -iA b P & -iP and (P) B, T, 
n(-) b P & — jP, we need to infer A D B, F, “10 b P & "iP. Use V-elim. 
and *59. 

Case — »V. Using Lemma 1 1 (a), we aim to infer T, “10 b VxA(x) 
from (a) F, -10 b A(b). Since F, — 10 do not contain b free (by the restric- 
tion on variables for the rule — »V), we can apply V-introd. (Theorem 
21 § 23) to (a) to infer (P) F, -|0 b VbA(b). By the stipulations following 
the rules in § 48, the hypotheses of Lemma 5 § 24 are satisfied, so by *73 
b VxA(x) ~ VbA(b). This with ((3) gives F, — 10 b VxA(x). 

Case. — >3. We must show that, if (a) F, — 10, — i i3xA(x), — iA(r) b 
P & — iP, then F, — 10, — i3xA(x) b P & — iP. From the 3-schema 
A(r) D 3xA(x) by contraposition (*1 2 §24) and Z3-elim., — i3xA(x) b — iA(r). 
Using this with (a) and Theorem 9, T, — 10, — i3xA(x) b P & -iP. 
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Corollary, (a) If b A t , . . . , A m — » B, then A,, . . . , A,„ b B. 

(b) If b A„ . . . , A,„ -> B 1; .... B„, then 
1- A x & . . . & A,„ D BjV. . .V B, (m, n > 0).- JH 

(e) If b -» Bj, . . . , B,„ //jen b B, V . . . V B„ (n > 0). 

(d) If I- Aj, . . . , A,„ then L -lfAj & ... & A J (/« > 0). 

Proof. From the theorem using Lemma 1 1, &— » and — »V, etc. — 

Using Theorem 36 and Corollary in (II) of Theorem 34, we obtain 
statements about the Hilbert-type system H. We give some other conven- 
ient versions of Godel’s completeness theorem with a Hilbert-type system 
in Theorem 37 below. 

Gcnerali/.ing the notion “E 1; . . . , E,,. b F” (§ 20), we say that F is a valid 
consequence of E„, E 1 , E 2 , . . . ( holding all variables constant), or in symbols 
E„, E„ H 2 , . . . b F, if, for each domain D, F is t for all assignments which 
make all of E„, E 2 , . . . t. Also, replacing “for each domain £>” by “for 

the domain D — {0, 1,2,.. we get the like with “N 0 -b” in place of 
“b” (and similarly with “/>b” for any other particular nonempty D). 

Generalizing the notion “E„ . . . , E k h F”, we say F is deducible from 
E 0 , E lt E a , . . . ( holding all variables constant ), or in symbols 
E 0 , E b E a , . . . h F, if there is a deduction of F from E 0 , E 1; E 2 , . . . (holding 
all variables constant) defined as before (§ 21), except with the infinite list of 
formulas E 0 , E 1; E 2 , . . . available for use as assumption formulas instead of 
only a finite list. But only finitely many of E 0 , Ej, E 2 , . . . can be used in a 
given deduction; so the notion “E 0 , E 2 , E 2 , . . . h F” as just defined is 
immediately equivalent to “for some d, E 0 , . . . , E a b F”. 

Clearly, “for some d, E 0 , . . . , E d b F” implies “E 0 , Ej, E.,, . . . b F”. The 
converse is not immediate (as it was with “b”); but it is given by Godel’s 
completeness theorem. 

Thlorem 37°. (Godel’s completeness theorem 1930.) In the predicate 
calculus //: 

(a) If b F [or even if N 0 -f= F], then b F. If E 1( . . . , E* b F [or even if 
Ei, ... , E fc K 0 -b F], then E^ . . . , E* b F. 

(b) If E„, E 1? E b F [or even if E 0 , E^ E 2 , . . . X 0 -b F], then, for some 

d, E,„ . . . , E,, b F, and hence E 0 , Ej, E 2 , . . . b F. 

(c) Either E x , . . . , E k are simultaneously H 0 -satisfiable, or 
b -i(E,& ...&E k )(k> 0). 

(d) Either E 0 , E x , E 2 , . . . are simultaneously H 0 -satisfiab/e, or, for some d, 
b — i(E 0 & . . . & EJ. 

Proofs. As with Theorem 35, it will suffice to prove each part under 

2J,i If we interpret “A, & ... & A„,” for in = 0 to be — i(P & — IP) (or P D P), and 
“B, V ... V B„” for ,i = 0 to be P & “IP (or — I(P O P)), then (b) holds without the 
restriction to in, n > 0. 
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the supposition that no formula contains bound any variable occurring 
free in any of the formulas. For, otherwise we could first replace the given 
formulas by ones congruent to them (§ 16) and meeting this condition, 
without affecting the holding of the model-theoretic hypothesis, and at the 
end use Theorem 25 and Corollary 2 Theorem 23 (§ 24) to pass back again 
to the original formulas in the Hilbert-type proof-theoretic conclusion. 

(a) Assume X 0 -E F. Then — ► F is not falsifiable in {0, 1, 2, . . .}. So by 
(II) of Theorem 34 (the upper version with k — 0, / = 1), E f/4 — » F. So by 
Theorem 36 Corollary (a), E y/ F. Similarly with k > 0. 

(b) Assume E 0 , E„ E 2 , . . . X 0 -t= F. Then it is impossible in {0, 1, 2, . . .} 
to make all of E 0 , E lt E 2 , . . . t and F f, simultaneously; i.e. 

. . . , E 2 , E t , E 0 — * F is not X 0 -falsifiabIe. So by (II) of Theorem 34 (the lower 
version), for some d, E E d , . . . , E 0 F. Now apply Theorem 36 Corol- 
lary (a). 

(d) The first alternative is equivalent to: (I) . . . , E 2 , E t , E 0 — * is X 0 - 
falsifiable. So by Theorem 34, if that alternative doesn’t apply, then: (II) 
for some d, E E d , . . . , E 0 — whence by Theorem 36 Corollary (d), 
E m(E 0 & ... & E d ). - 

The parts of Theorem 37 are not independent (Exercise 51.2). Moreover, 
the Lowenheim-Skolem theorem (Theorem 35) can be inferred from 
Theorems 12,, d and 37 (with the bracketed versions of the hypotheses), as 
easily as we inferred it from Theorems 33 and 34 (Exercise 51.3). 

The significance of Godel’s completeness theorem and the Lowenheim- 
Skolem theorem will be discussed at the end of § 52 and in § 53. 

Exercises. 51.1. Treat the following cases in the proof of Theorem 
36: — »&, — >~i, — (cf. Exercise 5.3), V — >, 3— ». 

51.2°. Infer all parts of Theorem 37, and the results of using Theorem 36 
in Theorenl 34, from each of: 

(a) //E 0 , E„ E,, . . . X # -E F, then E„, E 1( E g , ... E F. (This part of Theorem 
37 (b) is our most compact form of Godel’s completeness theorem.) 

(b) Theorem 37 (d). 

51.3. Infer Theorem 35 from Theorems 12 1>d and 37 (c), (d). 

§ 52. Godel’s completeness theorem, and the Lowenheim-Skolem 
theorem, in the predicate calculus with equality. In §§ 48-51, = 
might have been one of the predicate symbols, but we did not give it a 
special status in the evaluation process (or in the deductive apparatus), as is 
done in § 29. To have done so would have prevented us from simply making 
all atoms in uA t, and all in UA f, in the proof of Lemma 9. Thus the “E”, 
“ E” etc. in Theorems 34-37 are those of §§ 17, 21, 28, not of § 29. 849 In the 

m The proof of Theorem 33 is good under either meaning of g. We aren’t defining 
another formal system <74 for the case of the predicate calculus with equality, as we did 
with H in § 29. 
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Lovvenhcim-Skolem theorem (Theorem 35), if we start with a domain D 
and satisfying assignment for Ej, . . . , E t or E 0 , E 1( E 2 , . . . in which = does 
have the usual meaning of equality or identity, we cannot be sure that = has 
that meaning in the new satisfying assignment in {0, 1, 2, . . This can 
be remedied for Theorems 35 and 37, while the domain may become finite, 
by an easy device due to Kalmar 1928-9 and Godel 1930. The closed 
equality axioms mentioned in Lemma 12 are described in § 29 following 
Theorem 28. 

Llmma 12. If E ( „ E,, E g , . . . include the dosed equality axioms for 
= and all the proper predicate symbols and function symbols in E 0 , E 1( 
E 2 , . . . , and under the evaluation rules of the predicate calculus without 
equality E„, Ej, E a , . . . are all t for some non-empty domain D and assign- 
ment, then E () , E x , E.,, . . . are all t for a domain D* ( with 0 < D* < 3) 
and an assignment in which — has the meaning of equality or identity. Simi- 
larly with Ej E^ 

Proof. Consider a given domain D and assignment which make all of 

E 0 , e„ e 2 , . . . t. 

From equality axioms which are included in E„, E lt E 2 , . . . , each of the 
following formulas is deducible in the predicate calculus (§ 21 or § 28), as 
we see by adapting the proofs for Theorem 29 §29 (using V-elims. and 
3- and V-introds.): 

(i) Vx(x = x). (ii) VxVy(x=y 3y = x). (iii) V(x=y & y = z 3 x = z) 
where V is closure § 20 (i.e. in (iii), VxVyVz). 

(iv) For each proper predicate symbol P(aj a„) with n > 0 occurring 

in E 0 , E,, E 2 , .... and each /(/=!,...,«), 

V[x=y 3 (P(a lt . . . , a,.!, x, a, +1 , . . . , a„) ~ 

P(a 1( • • ■ , y, *tj+i> • • • > a n ))]. 

(v) For each function symbol f(a,, . . . , a„) with n > 0 occurring in E 0 , 

Ej, Ej and each i (i — 1 n), 

V[x=y 3 f ( a L , . . . , a^„i, x, a, fl , . . . , a M )“f(aj, . . . , a,_i,y, a t - +i , . . . , a n )]. 

Consequently (by Theorem 12,, (1 , etc.), for the given domain D and 
assignment, all the formulas (i)-(v) are t. 

Let .r ~ // be the logical function or predicate or binary relation assigned 
to the predicate parameter = in the given assignment; thus for each x, y 
in D, x~ y holds exactly when the formula x=y is t for .c and y as the 
values of x and y. 

From the form of (i)-(iii) (and their being t), and the rules for evaluating 
V, 3 and &, it follows that the relation x ~ ?/ is reflexive, symmetric and 
transitive; i.e. (a) for each x in D, x ~ x, (b) for each x, y in D, 
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x ex’, y y ~ x, and (c) for each x, y, z in D, x ~ y &ydz-+x~z 
(Exercise 52.1 (a)). Such a relation we called an “equivalence relation" in 
§ 30. By (B) there, ~ separates D into disjoint (i.e. non-overlapping) non- 
empty classes (called “equivalence classes”) such that any elements x and y 
of D belong to the same equivalence class if and only if x ~ y. The equiva- 
lence class x* to which x belongs is the class of all elements u of D such 
that .r ~ u (by (a), x belongs to this). Our new domain D* will be the set 
of all these equivalence classes. Then clearly 0 < D* < D. 

By the above definitions, when x and y are the values of x and y: 

{x = y is t} = x ~ y = 

{x and y belong to the same equivalence class} s x* — y*. 

Hence the truth value of x = y for given values x and y of x and y in D 
is not changed by changing the value x of x to another member u of D in 
the same equivalence class; and likewise with the value y of y. Similarly, 
from the form of (iv) (and its being t), etc., the truth value of P(a,, . . . , a„) 
for given values a u . . . , a n of a x , .... a, , in Z) is not changed by changing 
the value o, of a, to another value c, in the same equivalence class, i.e. 
such that x = y is t when x, y are evaluated by a„ c,(i = I, . . . , n). Like- 
wise from (v), the equivalence class to which the value in D of f(a,, .... a„) 
belongs for given values a„ . . . , a„ of a x , .... a„ is unaltered by changing 
the value a, of a, to another value e, in the same equivalence class 
(i'= I, • • • ,«)• 

So in every step in the evaluation of a formula, we can disregard the 
differences between different members of D which are in the same equiva- 
lence class without altering the truth value obtained; in effect we can 
coalesce the elements of D belonging to each equivalence class into one 
element which behaves like any one of them in the evaluation. If we con- 
strue the element obtained by this coalescing of elements in an equivalence 
class to be the equivalence class itself, we are then doing the evaluation in 
the domain D*, as we proposed. Since this coalescing does not affect the 
outcome of any evaluation, each of E 0 , E„ Eg, ... is t as evaluated in D*. 

The assignment in D* which accomplishes this, and which we have 
described as resulting from the given assignment in D by coalescing the 
elements of equivalence classes, can be described more explicitly as follows. 

To x=y is assigned the value t exactly when the values x* and y* in D* 
of x and y are the same member of D*. This is what we were aiming to 
accomplish. A proposition symbol P is evaluated in D* by the same truth 
value as in D. An individual parameter e, if evaluated in D by e, is evaluated 
in D* by c* (the equivalence class to which e belongs). An //-place predicate 
symbol P with n > 0, if evaluated in D by P, is evaluated in D* by P* 
where P*(a*, . . . , a*) = {P(a x , .... a„ ), for any choices a l3 . . . , a„ of 
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members of a*, . . . , a*, respectively}, that value being independent of the 
choices (as we inferred from (iv) being t in D). We define similarly the value 
f* in D* of an «-place function symbol f with /; > 0 from the value / off 
in D (Exercise 52.1 (b)). — 

Let “...(< N 0 )-|- . . stand for . . B-t ... for each domain D with 
0 < b < X 0 ”, and “(< K 0 )-satisfiable” for “satisfiable in some domain D 
with 0 < D < K u '\ 

Theorems 35 and 37°. Theorems 35 and 37 hold good reading “ the 
predicate calculus with equality ”, ”(< K 0 )-b” and “( < ^^-satisfiable” for 
"the predicate calculus'’, “X n -I=” and “ H 0 -satixfiable ”, respectively. 

Pkook We shall prove Theorem 37.. (d), Theorem 37.. (a)-(c) and 
Theorem 35 can be proved similarly, or inferred thence similarly to 
Exercises 51.2 (b) and 51.3 (Exercise 52.2). 

For Theorem 37 = (d), we treat the case that E 0 , Ej, E 2 , . . . contain X 0 
proper predicate symbols and function symbols, giving rise (with =) to K 0 
closed equality axioms Q 0 , Q„ Q 2 , . . .. 

Theorem 37 (d), applied to the list of formulas E 0 , Q 0 , E x , Q x , E 2 , Q 2 , . . . 
as its E„, Ei, E 2 , . . . , gives us two alternatives. 

Case (1): E„, Q 0 , Ej, Qi, E s , Q 2 , . . . are simultaneously satisfiable in 

the domain D — {0, 1,2,.. .}, under the evaluation rules of the predicate 
calculus. Then by Lemma 12, they are also simultaneously satisfiable in a 
domain Z)*(with0 < D* < D = N 0 ) with = meaning equality, i.e. under 
the evaluation rules of the predicate calculus with equality; a fortiori, 
E„, E t , E 2 . . . . arc so satisfiable. 

Casi (II): for some </, say d— 2c as the case of d odd is similar, 

h ~hE„ & Q„ & . . . & E c & Q ( .) in the predicate calculus. Thence by propo- 
sitional calculus, for that c, Q„, . . . , Q, b “t(E 0 & . . . & E r ) in the predicate 
calculus, liut then by § 29, h ~ i(E 0 & ... & E r ) in the predicate calculus 
with equality. — 

We conclude this section with some remarks on how Godel’s complete- 
ness theo rem 1 930 and the theorem. on consistency with respect to validity 
establish various equivalences between model-theoretic notions and proof- 
theoretic ones. These remarks apply both to the predicate calculus and to 
the predicate calculus with equality. (For the predicate calculus, we could 
have put them at the end of § 51.) 

B) Theorem 37 (a) and (b) with Theorem I2,.,i and the first paragraph 
of § 23 (or the like for the predicate calculus with equality): 

{b F} = {LF}, 

{F„ ...,E t tF}3 {E x E* b F}, 

{E 0 , Ei, E„ . . . b Fj = {E 0 , E„ E,, . . . b F}. 
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In the second and third of these equivalences, since we are using the 
symbols “f” and “E” without superscripts, the conditional interpretation 
applies to any free variables that occur in E„ . . . , E fc or E 0> E x , E 2 , . . . 
(§§20, 21). Before applying these equivalences, closures should be taken 
with respect to any variables intended to have the generality interpretation, 
by the operator V' of § 20 (by V, if all are to have the generality interpre- 
tation). 

The second equivalence is of interest when A lt .... A t are formulas 
expressing the axioms of some formal axiomatic theory in the symbolism 
of the (first-order) predicate calculus (without or with equality), and 
Ei, . . . , E fc are their respective closures. For example, A^ . . . , A k can be 
the 6 nonlogical axioms E1-E3, G1-G3 of the formal system G of group 
theory § 39. (It is natural to use free variables with the conditional inter- 
pretation in temporary assumptions for the sake of an argument, but not 
in the axioms of a mathematical theory. Instead, one would more naturally 
use individual symbols, like the symbol 1 of the system G. Hence we take 
“full” closures here, by V, not just V'.) “E x , . . . , E t t F” expresses the 
model-theoretic notion of what it means for F to hold in such a theory; 

it says that F is true in all mathematical systems which make A t A k 

simultaneously true (under the generality interpretation of their free vari- 
ables). Any such system we now call a “model” of Ai, . . . , A* (or of 
Ej, ... , E*); precisely, a model of A u . . . , A k is a (nonempty) domain D 
and an assignment in D to the parameters of E x , . . . , E* which make 
Ej, . . . , E fc simultaneously t (under the evaluation rules of §§ 17, 28, or of 
§ 29, according as we are working in the predicate calculus without or with 
equality). 250 “E^ . . . , E* E F” expresses the proof-theoretic notion of F 
holding in the same theory (§§ 21, 28, or § 29). 

The third equivalence has the same significance when the formulas A 0 , 
Aj, A 2 , . . . express in the predicate calculus (without or with equality) the 
axioms of a formal axiomatic theory with axioms, and E 0 , E lt E 2 , . . . 
are their respective closures. 242 For example, A 0 , A 1; A 2 , . . . can be the non- 
logical axioms of N §38, namely Axioms 14-21 and the X 0 axioms by 
Axiom Schema 13. (A finite number of axioms will not suffice for N; cf. 
end § 47.) A model of A 0 , A l5 A 2 , . . . (or of E 0 , E 1( E 2 , . . .) is a nonempty 

350 of ten 0 nc speaks loosely of a mathematical system satisfying some axioms, or of 
some axioms being true under an interpretation, when it is to be understood that free 
variables occurring in the axioms have the generality interpretation. The precise meaning 
is then what we have rendered here in the definition of ‘‘model”. 

In technical discussions, when we say that formulas are satisfiable, in general or in a 
given domain, we mean precisely that the formulas themselves are satisfiable (but not 
necessarily their closures, if the formulas are open); and similarly with “satisfying 
assignment”, “falsifiable”, “counterexample”, etc. 
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domain D and an assignment in D to the parameters of E # , E x , E 2 , . . . 
which make F 0 , E t , E 2 , . . . simultaneously t. 251 

Gddel’s completeness theorem also gives the equivalence of Hilbert’s 
proof-thcoreiic \crsion of the consistency problem for axiomatic theories 
based on the predicate calculus without or with equality (“Is the theory 
deductble from the axioms free from contradiction?”) to t he older model- 
theoretic version (“Are the axioms true of some system of objects?”), §~36. 

It was clear in a rough way back in the days of formal axiomatics that 
the existence of a mathematical system satisfying the axioms, or precisely 
what we now call a "model”, implies that no contradiction can arise in 
the theory deducibie from the axioms, if the theory in which the model is 
constructed is free from contradiction. The argument ran: Suppose a con- 
tradiction were deducibie from the axioms; then in the theory providing 
the model, by corresponding inferences about the objects constituting the 
model, a contradiction would be deducibie from the corresponding theo- 
rems. This was necessarily rough then, because “the theory deducibie from 
the axioms” only became exactly defined with the formalization of the 
whole language and logic in modern proof theory. We now have 
the principle that a model establishes consistency; simply by contraposing 
the chain of implications 

{E„ . . . , E, h P & -iPJ > {E„ . . . , E* k P & ->P} - 

{E 1( . . . , E* are not simultaneously satisfiable}, 

and similarly with E 0 , E„ E 2 , ... as the closures of the axioms. 

But the converse of that principle was by no means clear. Consider its 
contrapositivc. Why, when a system of axioms is vacuous (i.e. not true of 
any system. § 36), or as we are now expressing it, when their closures are 
not simultaneously satisfiable, must a contradiction necessarily follow from 
the axioms in a finite number of elementary logical steps? By Godel’s 
completeness theorem, however, we now know that this is so. Take the 

case of X„ axioms A 0 . A,, A 2 , If their closures E„, E„ E 2 , . . . are not 

simultaneously satisfiable, then by Theorem 37 (d) (or Theorem 37 = (d)), 
for some </, — i(E „&...& E,,) is provable in the predicate calculus (or the 
predicate calculus with equality), so that formula and the contradictory 
formula E„ & . . . & E ( , are provable in the formal system based on the 
predicate calculus (or the predicate calculus with equality) with A 0 , A lt 

4,1 In the illustrations by G and N, since the equality axioms for = and their function 
symbols are all provable, by Corollary Theorem 31 § 29 it makes no essential difference 
in the proof-theoretic notions whether the logic is the predicate calculus without or with 
equality (Axioms L1-E3, or Axioms IG, 17, are redundant in the latter case). But (for 
systems with = as a symbol) the model-theoretic notions of the predicate calculus with 
equality are usually the ones that primarily interest us. 
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A 2 , . • . as the nonlogical axioms. Another way, stated for the predicate 
calculus without equality (but adaptible to the predicate calculus with 
equality, by letting E 0 , E lf E 2 , . . . include the closed equality axioms and 
using Lemma 12): 

{E 0 , E„ E 2 , . . . are not simultaneously satisfiable) 

E 2 , Ej, E 0 — > is not falsifiable} 

{for some d , V E u , . . . , E„ — »} [Theorem 34] 

— >■ {for some cl, E u , . . . , E 0 V P & ~ iP} [Theorem 36]. 

ExiiRC'iSliS. 52.1. Supply details in the proof of Lemma 12 as follows: 

(a) Prove the transitive property of ~. 

(b) Write out the definition of /*. 

52.2. Prove Theorems 37 = (b) and 35 = (b). 

52.3. Prove (via model theory) the proposition of Footnote 107 § 29. 

52.4*. Show that, in the predicate calculus without equality, Axioms 

14-18 of N §38 possess no finite model (though they have a countably 
infinite one). 

§ 53. Skolem’s paradox and nonstandard models of arithmetic. 
Go del’s completen ess theorem of 1 930 is not as famous as his incomplete- 
ness theorem of 1931. But a little consideration will show that it is an 
equally remarkable theorem, when we include with it the older Lowenheim- 
Skolem theorem, which comes out of it immediately (Exercise 51.3), and 
of which Skolem’s 1922-3 proof comes close to the 1930 theorem. 853 

258 According to modern scholarship (van Heijenoort 1967), there are serious gaps in 
Lowenheim’s attempted proof 1915 of the theorem which bears his name. What 
Lowenheim stated is our Theorem 35„ (a) (except without all our possibilities for the 
definition of “formula”). 

The first correct proof of this proposition is in Skolem 1920, where it is generalized 
to a 0 formulas (giving essentially our Theorem 35_ (b) without the “compactness” 
part). In this proof, Skolem used the set-theoretic axiom of choice (§ 35), and his normal 
form 1920 for formulas of the pure predicate calculus (cf. Hilbert and Bernays 1934 pp. 
158 ff., 1M p. 435). 

The work of establishing the reducibility to Skolem normal form can be spared, 
however. By using instead any prenex form (Theorem 27 § 25), we get a very simple 
proof in the following manner (Skolem 1929 p. 24, Kleene 1958 p. 139). Say e.g. a 
prenex form of E is Vw3xVy3zA(w, x, y, z) where A(w, x, y, z) is a quantifier-free for- 
mula containing just the distinct variables v, w, x, y, z and two function symbols *, X(— ). 
Using the axiom of choice, then E is satisfied in a non-empty domain D by a certain 
assignment, if and only if VwVyA(w, p(w), y, 8(w, y)) (where p( — ), 8(— , — ) are new 
function symbols) is satisfied in D by that assignment plus some assignment to (3, 8. 
But a domain and satisfying assignment for the latter formula remain such, when the 
domain is cut down from D to the countable domain D* consisting of just the values of 
v and k and the other members of D obtainable thence by repeated applications of the 
functions evaluating X, p and 8, and the functions and predicates in the assignment are 
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The equivalences between model-theoretic notions and proof-theoretic 
notions which we have just recounted (end § 52) are remarkable in the 
following respect. The model-theoretic notions (validity, satisfiability, etc.) 
are highly transcendental : thus, for a domain of X 0 elements, the set of 
the logical functions (which is involved in those notions) has the uncount- 
able cardinal 2 N » (Exercise 34.4 (b)). The proof-theoretic notions, in con- 
trast, are quite concrete; they are (initar v. Still, these are results which were 
sought for the predicate calculus, whether or not they were fully expected. 
They confirm that the predicate calculus (without or with equality) fully 
accomplishes (for first-order theories) what has been conceived to be the 
role of logic. 

But Godel’s completeness theorem (or the Lowenheim-Skolem theorem) 


“restricted” to D*. ' 

These axiom -of-c hoice p roofs establish that, given any model of a formula or list of 
formulas, there is a finite or countably infinite submodel of the given model (in the sense 
just illustrated). No special attention is required to = , whose meaning of identity, if 
used in the given model, carries through to the new model. 

The device (used in §48 50) of picking truth values of atoms toward building up a 
satisfying (or falsifying) assignment, independently of the values the atoms have in a 
given such assignment, avoids the use of the axiom of choice, but sacrifices the submodel 
result, and (at least as ordinarily carried out) required a restoration of the usual meaning 
of = (as in § 52) in the case of the predicate calculus with equality. In 1922-3 pp. 
220-222 and 1929 pp. 24-29, Skolem used this device to give proofs of the Ldwenhcim- 
Skoiem theorem (in versions like our Theorem 35 (a) and (b)) which at least implicitly 
(but in retrospect quite obviously) establish everything in Godel’s 1930 completeness 
theorem (including compactness), with two exceptions. First, only a form of the basic 
discovery (beginning § 50) is established; it is not shown further that, when validity is 
confirmed by the finite mechanical process in Skolem’s treatment, one has provability 
in Hilbert’s sense. Skolem could hardly have done this in 1922-3, since Hilbert’s proof- 
theoretical formulation of the completeness problem only became clearly defined in 
Hilbert and Ackermann 1928 p. 68. So we can say that Skolem in 1922-3 discovered the 
completeness of intuitive logic (of first order), and independently Godel in 1930 (not 
knowing Skolem 1922 -3 or 1929) discovered the completeness of formal logic. Secondly, 
Skolem does not give the supplementary argument to restore the meaning of =. (There 
is no explicit discussion in his 1920, 1922-3 or 1929 of the impact of equality on the 
theorems in question, and it is not always clear what he intends regarding =.) 

According to Droben and van Heijenoort (in the introductory note to the translation 
of Skolem 1928 in van Ileijenoort 1967), one can get Godel’s 1930 theorem (without 
equality) by combining arguments in Skolem 1920 and 1922-3, or 1929, with ones in 
llerbi'and 1930 (plus now-famihar results on prenex form, say our Theorems 27 with 
1 2, and 19). 

The treatments of Gddel’s completeness theorem cited above do not deal with func- 
tions directly, as is done in Cases <C) and (L>) § 50 (exceptions: the argument obtained 
by combining Skolem and Herbrand, the related treatment in Kleene 1958 and 1961 
outlined in Footnote 280 below). Of course, since Hilbert and Bernays 1934, the results 
could be extended to functions via a replacement of functions by their representing 
predicates tcf. §§ 38, 45; IM p. 424). 
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has given us more than was sought. What was not sought, and presumably i 
was unexpected when discovered (in 1915, 1920 and 1930), is singled out 
in Theorems 35 and 35 = . These extra dividends make the theorem as much 
an incompleteness theorem for systems of axioms as it is a completeness __ 
theorem for logic. j 

Since Godel’s completeness theorem and the L6wenheim-Skolem \f 
theorem involve the non-constructive notions of validity and satisfiability , 
they do not belong to metamathematics. So their proofs cannot be w holly 
Unitary. Proofs like those above hold this non-constructiveness to a mini- 
mum. In fact, the only non-constructive steps in the proofs of Theorem 34 
and of Theorem 37 with “N„-t=” arc applications of the classical law of the 
excluded middle to propositions about countably infinite collections. 253 

It may not at first seem remarkable that, if formulas Ej E k or 

. . are simultaneously satisfiable in some domain D, they are 
also simultaneously satisfiable in {0, 1,2,.. .} or a finite domain, as the 

Lowenheim-Skolem theorem asserts. But suppose E t E fc or E 0 , 

E 2 , . . . are the clos ures of the axioms of a system of axiomatic set theory . 
Take e.g. the axioms of Godel 1940. In them Godel used two sorts of 
variables, one for “classes” (end § 35) and one for “sets”. But all sets are 
classes, and no other objects are considered. So the axioms can be restated 
using only class variables, and three predicate symbols: x=y, xey and 
'l)l(x) (“x is a set”). 254 Mathematicians gen erally suppose that the. axioms. 
Aj, . . . , A 17 are all true of a nonempty system of objects (the class es 
including' the setsTToT Godel’s set theory). That is, they sup pose l h cui ix it_ 
non-empty domain D and an assi gn ment in D to the piedicaLc-symhols 

e'and Dt (with = meanin g equalitvl for which the closures F. t E 17 of 

the axioms are all t. (The D and assignment are a model of the axioms.) 
But then by the Lowenheim-Skolem theorem (Theorem 35 = (a)), there is 
a domain D* with 0 < D* < N 0 in which E t E 17 are also simulta- 

neously satisfiable, Inspection of the axioms rules out the possibility that 
D* < N 0 . Thus, if there is any model of the axioms at all, there is a count- 
ably infinite model (with = meaning equality). There are only K 0 “sets” 
in the new model. Yet in the axiomatic set theory based on the axioms, 
Cantor's theorem holds ((C) § 34). By this theorem, the set of the subsets 

213 Utilizing this obscrvationj ^Hilb ert and Bernays 1939 pp. 234-253 formalize their 
proof of Cibdel’s completeness theorem in (essentially) the number-theoretic system 
N, and thus establish the following.jn.etarnathematicaI completeness theorem for the 
pure predicate calculus (cf. IM p. 395). 

The addition to the predicate calculus of an improvable predicate letter formula for use 
as an axiom schema would cause the number-theoretic formal system based on the predi- 
cate calculus and the nonlopical axioms of N (§ 38) to become os-inconsistent (§ 47). 

231 Cf. IM p. 420 Example 13. Axiom A1 and the predicate symbol (£ls (x) (“x is a 
class”) can then be omitted. 
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of the natural numbers (which is a set in the theory) is uncountable. This 
is Skolcm's ‘paradox” 1922-3. Skolem used his own axioms for set theory 
1922-3, with K 0 axioms. The argument is the same with X 0 axioms, using 
Theorem 35 . (b), or Theorem 35 (b) for set theories (having only sets as 
objects) in which = is defined from e as in § 26. 

1 Skolcm's is not a paradox in the sense of an outright contradiction, but 
** rather a kind of anomaly. For, there is an explanation: the “enumerating 
set” of ordered pairs, by which D* is put into 1-1 correspondence with the 
natural numbers, i s not itself a set adm it ted in the axiomatie._set theory . 

The “paradox” entails that any axiomatization of set theory by count- 
ably many axioms in the restricted predicate calculus with equality must 
fail fully to capture the notions of “set”, “set of subsets of a given set”, 
“1-1 correspondence”, “countable”, etc. These concepts, if we give them 
a prior status, elude characterization by any such set of axioms. But the 
paradoxes of set theory (§ 35) make it hard to give them a prior status, 
independent of any system of axioms. This led Skolem to the view that the 
concepts of set theory have only a relative status (“relativity of set theory”). 
Thus a set which is unc ountable in on e axio matization c an be counta ble 
1 in another, and there i s no absolute notion of countabilit y, 
v Of course, another conceivable explanation of Skolem’s “paradox” (for 
a given system of axiomatic set theory) is that ther.e is no model. In this 
case, by Godel’s completeness theorem (as at the end of § 52), 

-i(E! & ... & E a .), or -i(E 0 & . . . & E d ) for some d, would be provable in 
the predicate calculus. Thus there would be a “real” paradox, i.e. a con- 
tradiction, in the axiomatic set theory, though (in any of the systems of 
axiomatic set theory commonly employed) it has n o t^y et_b e e n disco ve red. 

We conclude this section with two applications of “compactness”, viz. 
that if each finite subset of E 0 , E x , E 2 , . . . are simultaneously satisfiable, 
so are all N 0 of the formulas (in Theorems 35 and 35^,). 

The first application (Theorem 38) deals with the question whether the 
natural number sequence 0, 1,2,... can be completely described by a list 
or axioms A 0 , A t , A 2 , . . . written in the symbolism of the (first-order) pred- 
icate calculus with equality. 

In particular, are the X 0 non-logical axioms A 0 , A'j, A 2 , ... of the num- 
ber-theoretic system N § 38 true only under the intended interpretation 
described there? The mathematical system = ( D 0 , 0 0 , ' 0 , + 0 > 'o)> con ' 
sisting of the natural numbers D 0 — {0, 1,2,.. .} as the domain, and the 
usual zero, successor, plus and times as the values of 0, ', + and •, consti- 
tutes a model for them (with = meaning equality); i.e. this domain and 
assignment makes their closures E 0 , E x , E 2 , . . . simultaneously t in the 
predicate calculus with equality. The question is whether they have any 
other model than S„. 

In a trivial way, they obviously do. For, the axioms do not say what the 
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individual members of the domain D shall themselves be. The axioms leave 
us free to choose those members as we please, provided we can then choose 
from among them an individual to be called 0, and functions to be called 

+ and ■, having the formulated properties. So we can satisfy E 0 , Ej, 
Eo, . . . by taking as D any countably infinite set, and using for the values 
of 0, + and • the member of D, and the 1-, 2- and 2-place functions 

over D, which are determined from the usual ones via some enumeration 
of D. Thus the three following systems S 0 (the usual natural numbers), 

(the non-positive integers with unusual definitions of ' and •) and S 3 
(the positive integers with unusual definitions of 0, + and •) are all models 
of A 0 , A u A 2 , . . . : 

S„ = (A,, 0„, ' 0 , + 0 , •«) = ({0,1, 2, . . 0, x+ 1, x+y, xy), 

5, = (£»!, 0 1; \, +!, -i) = ({0, -1, -2, . . .}, 0, x-l, x+y, -xy), 

5 2 = ( D 2 , 0 2 , ' 2 , + 2 , - 2 ) = ({1, 2, 3, . . 1, x+l, x+y- 1, (x- 1)0/- 1)+ 1). 

If we have a prior notion of . . . , — 2, — 1 , 0, 1 , 2, . . . under which the sets 
{0, 1,2,.. .}, {0, —1, —2, . . .}, {1, 2, 3, . . .} are different, the systems S 0 , 
S ly S 2 are different. But they are “inessentially” different, in that they have 
a common structure or form; they are “isomorphic”. 

To define this idea carefully, we say a given 1-1 correspondence of D* 
to D is an isomorphism of S* = ( D *, 0*, +*, •*) to 5 = (D, 0, ', +, •)> 

if it “preserves” the notions of the system, i.e. (writing x*ox to say that, 
in that 1-1 correspondence, x * as member of D* corresponds to x as 
member of D), if 0*<=>0, x*ox -> x*'*ox' and 

i*oa; & y*oy — > x*+*y* <=> x+y & x*-*y*<s>x-y. We say S* is isomor- 
phic to S, if there is an isomorphism of S * to S. This defines “isomor- 
phism” and “isomorphic” for systems of the type of (Z>, 0, ', +, •) where 
D is a nonempty set and 0, +, • are 0-, 1-, 2- and 2-place functions 

over D with values in D. The definitions are similar for other types of 
systems; e.g. for (D, <) where D is a nonempty set and < a 2-place 
predicate over D, an isomorphism of (D*, <*) to ( D , <) requires' that 
x*ox & y*<t>y -* ( x*<*y * = x<y). 

If we consider systems of the type of (£>, 0, ', +, •) without regarding 
the members of D as known other that through the relationships of the 
system, then the various isomorp hic s ystems just des cribed beco me me rged 
into a single “abstract syste m”, of which they are different “represent a- | 
tions” ; cf. I !VT § 8. (Such an abstract system can be regarded a s an equiva - 
len ce class of m ore concrete sy stems under the equivalence relation ' 
“isomorph ic toll). 

The following system S 3 (the usual integers) is not isomorphic to S 0 \ 

5 3 = (£>3,0 : „ ' 3 , + 3 , - 3 ) = ({-.., —2, -1,0, 1, 2, . . .}, 0, x+l, x+y, xy). 
For, suppose we attempt to establish an isomorphism of S 3 to S 0 . Say e.g. 
that, in the proposed 1-1 correspondence of {. . . , —2, —1, 0, 1,2,.. .} to 
{0, 1, 2, . . .}, 2o0. Then the requirement for an isomorphism that 
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x*< >x- - forces us to put 3<^>l,4<r>2, 5c>3, . . . , and we have 

no elements of S n left over that can correspond to the remaining elements 
.... —2, —1,0, 1, of S 3 . The system S 3 is not a model for the axioms of 
N, since Axiom 15 ~i u/=0 does not hold for it (under the generality inter- 
pretation of a); its closure is f for S 3 as the domain and assignment, since 
when o has the value —1, «'=0 is t and ~ m'=0 is f. 

Now we rephrase our question about N to ask the following. Do the 
nun-logical axioms of N have only models isomorphic to the intended 
model (£>„, 0,„ +„,-„)? In the terminology of §36, is this system of 

axioms categorical ? Are these axioms true of no other abstract mathemat- 
ical system than the natural numbers (considered abstractly)? 

More generally, is there some countable set of formulas A 0 , A„ A 2 , . . . 
in the symbolism of the predicate calculus with equality (including at least 
the symbols 0 and ') which constitutes a categorical set of axioms for the 
natural numbers? 

These questions are answered in the negative by the following theorem. 

Thloki.m 38. (Skolem’s theorem 1934 on nonstandard models of 
arithmetic.) Let A„, A), A™, . . . he formulas in the predicate calculus with 
the symbols 0, = and possibly other ( individual ,) function and predicate 

symbols. Suppose A,„ A t , A s , . . . are all true ( under the generality interpre- 
tation of their free variables) of the system S of the natural numbers 
D — {0, 1,2,...} with 0, ', = in their usual meanings (and any appropriate 
ipeanings of the other ( individual ,) function and predicate symbols)', i.e. 
S = ( Z), 0, ', . . .) is a model of A 0 , A,, A 2 , . . . in the predicate calculus with 
equality. Then there is a model S* = ( D *, 0*. . . .) of A 0 , A b A 2 , . . . in 

the predicate calculus with equality such that D* = N 0 and (£>*, 0*, '*) 
is not isomorphic to ( D, 0, ') ( a fortiori, S* is not isomorphic to S). 

Proof. Wc can suppose that A 0 , A b A 2 , . . . include Axioms 14 and 15 
of N, as otherwise we could add them (since they are both true of S). Let 
E 0 , Ei, Eo, ... be the closures of A 0 , A b A a , . . .. Let i be some individual 
symbol not in A 0 . A b A a , . . .. Consider the list of formulas 

Eqj Oj^i, Ei, 1 E 2 , 2t ^\, . . . 

where (as in § 38) 0, 1,2,... are the terms 0, O', 0", . . . (called “numerals” 
in §43) and r?£s is — ir=s. For each d, the first d+ 1 of the displayed 
formulas are simultaneously satisfiable in the domain D of the natural 
numbers; indeed, w'e can use the given model S of A 0 , A b A 2 , . . . aug- 
mented by J as value of i, since (where d\s 0' . . . ' with d accents, as 
in §43) is not among the first d + 1 formulas. So by compactness (in 
Theorem 35 (b), using the bracketed version of the hypothesis), all the 
X 0 displayed formulas are simultaneously satisfiable in some domain D* 
with 0 < D* < X 0 ; say they are satisfied in D* by 0*, as values of 
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0, . .. However, since Axioms 14 and 1 5 are among A 0 , A lt A,, . . . , we 
easily exclude D* < X 0 (Exercise 53.1); thus 5* = R 0 - We must show 
that ( D *, 0*, '*) is not isomorphic to ({0, 1,2,...}, 0, '). Suppose there 
were an isomorphism of (D*, 0*, '*) to ({0, 1,2,...}, 0, ')• Say that in it (as 
a I I correspondence of D* to {0, 1,2,.. .})a*<=>0, a* <=>1, a* o2, . . 
so D* = {«*, r/*, a?, . . .}. Now what are the values in 5* of the numerals 
0, 1, 2, . . . ? Since their values in 5 (computed from the usual zero and 
successor as the values of the symbols 0 and ') are the natural numbers 
0, 1,2,..., their values in S* (computed from 0*, '* as the values of 0, ') 
must be a*, af, a%, . . . , by the properties of the isomorphism that0*<x>0, 
x* ox—> x*'*ox'. Now what is the value a* in S* of i? Because O^i, 

1 ^i, 2#i, . . . are all t in S*, a$ 7 ^ a*, af 7^ a*, a* a*, This is 

absurd, since a* e D* and D* = {a$, a *, . . .}. 

A system like S* which satisfies the axioms of number theory but is not 
isomorpfiicJo~the TTsir aT or ^‘statidlmP number theory is called a nun- 
itaiulaai model of number_ theory (or of arithmetic ), or a Skolem model. 
the theorem that any (finite or) countably infinite set of number-theoretic 
axioms in the first-order predicate calculus with equality has a non- 
standard model was originally proved by Skolem in 1933 (finite case) and 
1934, using a direct construction of the nonstandard model. SkolenTs 
construction was employed by Ryll-Nardzewski 1952 in proving the 
non-(finite-axiomatizability) of number theory § 47. 

The very compact proof above based on compactness is due to Henkin 
1947 p. 70, 1950 p. 90. 

It is surprising that the existence of nonstandard models of the usual 
ixioms of elementary number theory was not widely recognized very early 
ay juxtaposing Godel’s completeness theorem 1930 and his incompleteness 
heorein 1931, thus: 255 
[formal number theory} = 

{predicate calculus} 4- {number-theoretic axiom system}. 
By Godel 1930, {predicate calculus} is complete. 

By Godel 1931, {formal number theory} is incomplete. 

Therefore, {number-theoretic axiom system} is incomplete. 

Not however with quite the version in Godel 1931, where the incompleteness is 
'i veil for “Principia Mathematicu [Whitehead and Russell 1910-1 3) and related systems”. 
’M is not of the form ((first-order) predicate calculus) + fnumbcr-thcorclic axiom 
y stein }. Numher-theoretic systems of that form (like N in §38) came in with Hilbert 
928, so they are sometimes called Hilbert arithmetic. In 1 93 1-2 (reporting a colloquium 
teld January 22, 1931), Godel states his incompleteness theorem for such a formal 
ystctn, with all the usual axioms of elementary number theory. 

When the author pointed out the connection between Skolem's and Godel's theorems 
n IM (1952b) p. 430 and 1956 § 18, he knew of no earlier mention of it in the literature, 
(eccntly, he has come across 2 lines in Godel’s review of Skolem 1933 (1934a p. 194 
ines 10-11) and 3 in Henkin 1950 (p. 91 lines 8-10) alluding to such a connection. 




328 I HE PREDICATE CALCULUS (ADDITIONAL TOPICS) CH. VI 

To give this argument more explicitly for the N of § 38, consider the 
formula nC ( , of Theorem V § 43 (Godel’s incompleteness theorem), which 
is true under the standard interpretation of the number-theoretic symbol- 
ism, but improvable in N. In Godcl’s completeness theorem (say with 
cqualilv. Theorem 37. (b)) "If E„, II,, E 2 , . . . (<N 0 )-t= F, then E 0 , E,, 
E 2 , . . . h F”, take E„, E,, E 2 , . . to be the closed number-theoretic axioms, 
and F to be — iC'„. Now L„, I:,. I; 4 , . . . must be true under some inter- 
pretation (not the standard one) with a countable domain in which ~ iC„ 
is not true, or Godel’s completeness theorem would imply that E 0 , E„ E 2 , 
. . . I — iC„ in the predicate calculus with equality, 251 and hence that 
h — iC„ in N, contradicting Theorem V. 256 

From this standpoint, Godel’s incompleteness theorem is a phenomenon 
of the same kind as the unprovability of either Euclid’s fifth postulate or 
its negation from the other postulates of Euclid (§ 36). Euclid’s fifth 
postulate (or — iC„) is true under one interpretation of the axioms in 
question, false under another. 

In any countable model (£>*, 0*, + *, •*) of N, D* — X 0 , since the 

axioms of N include Axioms 14 and 15. In any nonstandard model 
(T>*,()*, +*,•*) of N, the part (D*, 0*, '*) is nonstandard, i.e. not 

isomorphic to ({0, 1 . 2, . . .}, 0, '), since the axioms of N include the 
recursive definitions of + and ■ (Axs. 18-21), For, to make the closures 
of these four axioms all t when 0 and ' have their usual meanings with 
D = {0, I, 2, . . .) as the domain, we are forced to give + and ■ their 
usual meanings; and hence, if (D*,0*,'*) were isomorphic to (£>, 0, '), 
then ( D *, 0*. + *, •*) would be isomorphic to (Z), 0. +, •)• So we do 

get Skolem’s theorem for N. 

The proof of Skolem’s theorem by juxtaposing Godel’s two theorems 
applies to lists A„, A,, A.,, ... of formulas that can be the nonlogical 
axioms of a formal system of number theory. This entails an effectiveness 
requirement, used in proving Godel’s incompleteness theorem (cf. the 
discussion in § 43). 257 This effectiveness requirement does not detract from 

2,6 Still another pair of circumstances shows very simply that there must be non- 
standard models. By the hierarchy theorem of Kleene 1943, the predicates definable in 
the symbolism of N fall into a hierarchy, with predicates of successively higher levels 
requiring more and more quantifiers in their definitions (§46 including Exercises 
46.8, 46.9 (a)). This of course is proved by reasoning with the standard model of number 
theory. But by IM Theorem 40 pp. 401-402, 22 " all the satisfying predicates in Godel’s 
completeness theorem arc definable using just two quantifiers. These results would be 
incompatible, if the model for the latter were (he standard one. This proof is given in 
detail in IM pp. 428 429. 

- To any effective list A,„ A ,, .4,, ... not sufficient to give us the formulas C„ of 
§ 43 using in ihem only the parameters 0, ', + , •, or not including Axs. 14, 15, 18-21, 
we can first add the axioms of N § 38. 

1 here is a way of extending the argument to some nonclTective lists. Cf. IM p. 431 
Remark. 1 . 
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the significance of Skolem's theorem as showing that no list of axioms we 
could actually use can describe the number sequence completely. However, 
Skolem’s proof and the Henkin proof given above apply to any list A 0 , 
A„ A 2 , . . . , whether given effectively or not. 

Peano’s axioms for the natural numbers are commonly regarded as 
being categorical. 1 ’- We formalized these axioms in setting up the system 
N § 38. This apparent contradiction between Skolem’s theorem and the 
categoricity of Peano’s axioms is explained away as follows. The fifth 
Peano axiom asserts that the principle of mathematical induction holds for 
all properties (i.e. I -place predicates) of natural numbers. There are 2 X ° 
such properties. The fifth Peano axiom is only incompletely formalized 
in N, since Axiom Schema 13 gives induction only for the N 0 properties 
expressible by formulas A(x) of N. 

Translating from Skolem’s German, . . the number series is completely 
characterized, for example, by the Peano axioms, if one regards the notion 
‘set’ or ‘propositional function’ as something given in advance with an 
absolute meaning independent of all principles of generation or axioms. 
But if one would make the axiomatics true to principle (konsequent), so 
that also the reasoning with the sets or propositional functions is axioma- 
tized, then, as we have seen, the unique or complete characterization of the 
number series is impossible.” 238 

Nonstandard models have recently become a standard branch of logical 
research; cf. Henkin 1950, Rabin 1958a, Kemeny 1958, Scott 1961, A. 
Robinson 1961, 1963. 

For example, the type of order in the nonstandard models of arithmetic 
is known. Here we suppose the symbolism to include the predicate symbol 
<, and the formulas A 0 , A u A 2 , ... to include at least the nonlogical 
axioms of N §38 and the formula ^<4^3 rf+«=/ ! (< now being 
primitive). Then from their closures E,„ E„ E 2 , . . . , the formula a<b is 
deducible in the predicate calculus for all pairs of numerals a, b for natural 
numbers a, b such that a < b, and ~ia<b for all pairs such that a > b 
(I M pp. 196-197). What orderings can the nonstandard models S* have 
under the order relation a < b = {a<b is t in 5* when a, b are the values 
in D* of a, b} ? According to Kemeny 1958 (who says the result was found 
in 1947 by Henkin and him, but was known much earlier to Skolem), only 
one ordering is possible. Under this, D* consists of the usual natural 
numbers, followed (in increasing order) by families of elements, the ele- 
ments within each family having the same order as the integers, and the 
families having the same order as the rational numbers. 

To describe the order type of the nonstandard models, of course we 

1934 p. 160. Skolem like Peano used the positive integers instead of the natural 
numbers. 
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arc thinking in terms of the standard natural numbers, whence the stand- 
ard integers and rational numbers are constructed. 

Theorem 38 denies the possibility of a formal axiomatic characterization 
of the natural number sequence in the first-order predicate calculus. 

We argued in § 36 that formally axiomatized mathematics is not all of 
mathematics. An intuitive understanding of the natural number sequence 
is already presupposed in the statement of Theorem 38. Thus, in “A 0 , A t , 
A 2 , . . the reader is ejected to understand what the three dots 
mean. Also we wrote “ D * = R 0 ”. Also, we cannot contemplate, as we 
ordinarily do, languages such as the axioms are to be stated in, without 
using concepts basically equivalent to that of the natural numbers. Ab- 
stractly, the sequence of the natural numbers has the same structure as the 
sequence of expressions j, J|, |||, ...» which we used to represent them on 
a Turing machine tape §41. The possible expressions in a language such 
as the predicate calculus (unless we put a fixed upper bound on their 
lengths, so only finitely many of them exist) form a similar system, only 
with an alphabet of more than one symbol. 

Mal'cev 1936 and llenkin 1949 constructed nonstandard models of 
analysis (theory of real numbers), and A. Robinson 1961a, 1963, 1966 
applied them to develop a new approach to (classical) analysis. Bernstein 
and Robinson 1966 thus solve a problem in Hilbert spaces. 

Theorem 39. (Henkin 1949.) If a list of formulas A„, A t , A 2 , . . . in 
the predicate calculus with equality admits arbitrarily large finite models, it 
also admits a countably infinite model. 

Proof. Let E 0 , Ej, E 2 , . . . be the closures of A 0 , A u A 2 , . . .. Let 
Q 0 , Q x , Q,, ... be the following formulas, which are respectively satisfied 
if and only if there are at least 2, 3, 4, . . . elements in the domain: 

3x3y(x?*y), 3x3y3z(x^y &x^z &y?£z), 

3w3x3y3z(w,^x & w^y & w?£z & x^y & x^z & y¥>z) 

Using the hypothesis, for each d, the first d + 1 of 

Lo, Qo> Ei, Qi, E 2 , Q„ . . . 

are simultaneously satisfiable in a respective domain D d ; we just use one 
of the given models having > c+2 elements, if d—2co\d= 2c+l. 
Hence by compactness (in Theorem 35„), all of the formulas last displayed 
are simultaneously satisfiable in a domain D with 0 < D < X 0 . However, 
it is absurd that D < X 0 , as then g, for i> D - 1 could not be satisfied. 

Exercises. 53.1. Prove in detail for Theorem 38 that D* = X 0 . 
(Hint : use Exercise 38.5.) 

53.2. Show that the theorem in Footnote 253 cannot be improved by 
replacing "o>- in consistent" by "(simply) inconsistent" . 
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§ 54. Gentzen’s theorem. In the course of proving Godel’s com- 
pleteness theorem (§§ 48-51), we have been led to a new type of formaliza- 
tion of logic, represented by the Gentzen-type system G4. Indeed, combining 
Theorems 12 1>d , 34 and 36 Corollary (a), we have the equivalence of three 
notions, for any formula F not containing any variable both free and 
bound : 25U 


{h„F} = {!= F} s { h W4 —* F}. 

The “ c ” indicates that classical (nonfinitary) model-theoretic reasoning 
is used. 

The basic result that predicate logic can be formalized in a system like 
(74, or specifically the equivalence { h 7/ F} = { h 04 — ► F} (or later, 

{ Yjj F} = { Y U4a -* F}), we shall call Gentzen’s theorem. From this 
equivalence, the principal results of Gentzen 1934-5 for classical logic, 
and some versions of Herbrand’s theorem 1930 (§55), can easily be 
extracted. 

Gentzen 1934-5 established the basic result entirely within finitary meta- 
mathematics, thus: 


{ br F} == { b G i -* F} 3 { h 01 without cut -* F}. 


The system called “LK" by Gentzen and (with minor differences in the 
notion of formula) “G 1” in IM is a sequent calculus generally similar to 
(74, except that it has a rule called “cut”, as follows: 


A— » A, C C, r -»0 
A, r->A, 0 


Cut 


where C is any formula, and A, A, T, 0 are any lists of formulas. The 
presence of this rule makes it easy to prove { Y u F} — *- { h wi — » F} ; the 
converse is essentially the same as our Theorem 36. In his “Hauptsatz” or 
“normal form theorem”, Gentzen showed that the uses of this rule can 
be eliminated from any given proof in (71 to obtain a proof of the same 
sequent in “(71 without cut”. 

Herbrand and Gentzen used their theorems in giving consistency proofs 
from Hilbert’s standpoint (like those cited in §45 Footnote 194 and §47 
Footnote 216) and in other metamathematical applications. Such applica- 
tions would lose their point if the theorems could not be proved meta- 
ma thematically. Thus, for metamathematical purposes, our treatment at 

a,s To deal with formulas containing variables both free and bound, we can use 
Theorem 25 as in the proof of Theorem 37. 
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this point leaves a gap to be filled, just as for Theorems I-VIIl some gaps 
were left. 260 

Gentzen also gave a version “LJ” of his system for the intuitionistic 
logic, for which we gave a Hilbert-type system at the end of § 25. 261 Here 
our model-theoretic route (via “F F”) is not available, at least not without 
considerable etfort to set up an intuitionistic model theory first (i.e. to give 
an intuitionistic analog ol the classical notion ‘T F’ 

We formulated the 14 rules of our Gentzen-type system G’4 so as to 
give ourselves as few choices as possible at each step upward in searching 
systematically for a counterexample to F or to E x , . . . , E fc — » F x , . . . , F,. 
For a different reason, we required the C for the axiom schema ( x ) to be 
prime. 

When there is no counterexample, so F — > F or 
F E„ . . . , E k —> F\, . . . , F, in G4, the use of other Gentzen-type systems 
G'4a and G4b (described next) may afford simpler proofs. Likewise, in 

A.S llie chart shows, the gap can be filled by reading the proofs of IM Theorems 
46 and 48 tj§ 77, 78, and using for our applications the Gentzen-type systems G4a and 
6’4b introduced below. For agreement with IM and Kleene 1952, wc can optionally 
take "A ~ IT’ in our systems to be an abbreviation lor (A3B)&(BD A). 


(2)0° 



(I) Theorem 12,, a . (2) Theorem 34 (Gddel’s completeness theorem with a Gentzen-type 
formal system). (3) Theorem 33. (4) Theorem 36 Corollary (a) or IM Theorem 47. 
(5) IM Theorem 46. (6) IM Theorem 48 (Gentzen’s Hauptsatz). (7) Cf. Exercise 49.1 (f) 
and (g), or Kleene 1952 Lemma 9. The systems grouped together are very similar, and 
easily proved equivalent. The unnumbered implications are trivial (one system is a 
subsystem of the next). The boldface arrows show the main part of the work on each 
route. 

261 In this chapter, the absence of a “°" on a theorem concerning provability in 
Gentzen-type systems means that there is a known intuitionistic version of the theorem 
(sometimes with qualifications), using the intuitionistic version of one of the systems 
(71, G2, G3, G3a, G. When we do not indicate otherwise, this intuitionistic version can be 
found in IM or Kleene 1952. 

262 For references to Beth’s intuitionistic model theory or semantics, see Kleene and 
Vesley 1965 bottom p. 81. 
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attempting to construct proofs downward, and in performing manipula- 
tions on given proofs, the greater flexibility provided in G4a or G4b is often 


advantageous. 

To get G4a from G4, we add four new rules of inference called “thinning 
(in antecedent or succedent)” and “contraction (in antecedent or suc- 
cedent)”: 


r-»© 
r -*• 0, c. 
r->0, c.c 
r*-> 0, c. 


*T 


~>c 


0 


c, r — + 0. 
c, c,r->0 
c, r->©. 


T—y 


Here C is any formula, and T and 0 any lists of formulas. As with the 
other rules, the order of formulas within antecedents and within succedents 
is immaterial. These rules (or inferences by them) we call structural-, the 
former rules logical, or more specifically propositional) (—>3, , . . , ~-+) 
and predicate (— >V, .... 3—»). To save space that would be required in 
using the thinning rule separately, we shall usually write as one inference 
a series of applications of thinning followed by a logical inference. This 
has the effect of making the 14 logical rules applicable even with formulas 
in the premise(s) absent. 

Example 7. The following proof in G4 can be simplified to a proof of 
the same sequent in G4a by omitting the 13 boldface formulas. This gives 
the maximum simplification which G4a allows. 


P(a), R, Vx(R & P(x», Q(b) -» 3xP(x), P(a) 

P(a), R, Vx(R & P(x», Q(b) — » 3xP(x) 1 v 
R & P(a), Vx(R & P(x)), Q(b) -» 3xP(x) y v 
Vx( R & P(x)), Q(b) 3xP(x) ' 

Q(b), Vx(R & P(x» -» 3xQ(x), Q(b) ^ 

Q(b), Vx(R & P(x» -» 3xQ(x) f & 

Vx(R & P(x)), Q(b) -» 3xP(x) & 3xQ(x) o 
Vx(R & P(x)) -» Q(b) D 3xP(x) & 3xQ(x) 

Vx(R & P(x)) -» Vx(Q(x) D 3xP(x) & 3xQ(x)). 

The -*&-inference as it appears after omitting the boldface formulas can 
be analyzed as consisting of two thinnings T— * and one —>8», thus: 

Vx(R & P(x)) — » 3xP(x) Q(b) -» 3xQ(x) ^ 

Vx(R & P(x)), Q(b) -» 3xP(x) Q(b), Vx(R & P(x)) 3xQ(x) v & 

Vx(R & P(x)), Q(b) — > 3xP(x) & 3xQ(x). 

In G4b, we further allow that the C for an axiom need not be prime. 
Theorems 33 and 36 and Corollary (and Lemmas 6 (a) and 7 with 
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the new postulates included) extend immediately to G4a and (74b, since 
the treatment of the cases for — >T, T— *, —*C, C~> and the liberalized 
axioms is obvious. 

The subfonmda(s) of a formula F are F itself and all the formulas obtain- 
able from it by repeated dissection (by removing one operator after another, 
all the way down to atoms); here, we allow the result of removing Vx 
from VxA(x) or 3x from 3xA(x) to be A(r) Tor any term r free for x in A(x). 

In other words: First, define the immediate subformula(s) of a composite 
formula F to be the formula(s) which can be side formula(s) to F as 
principal formula in any of the 14 logical rules — . . . 3— Then the 
subfonmda(s) of any formula F are F itself, its immediate subformula(s) 
if it is composite, the immediate subformula(s) of each of the latter which 
is composite, etc. 

For example, the subformulas of VaVb(P(b) & Q(a)) are 
VaVb(P(b) & Q(a)), all formulas Vb(P(b) & Q(r)) where r is any term not 
containing b (free), all formulas P(u) & Q(r) where u is any term (with r as 
stated), and finally all formulas P(u) for such u and Q(r) for such r. In par- 
ticular, Q(b) is not a subformula of VaVb(P(b) & Q(a)); cf. Exercise 49.2. 

Combining the definition of “subformula” with our observations early 
in § 49 of ancestral relationships in proofs in (74 (which exist likewise in 
G4a and (74b), we have: 

Ltmma 13. (Ancestor and subformula property.) In a proof in G'4, 
G4a or G4b, in any given sequent'. 

Each formula occurrence is identifiable as an ancestor of a specific formula 
occurrence in the endsequenf, and the former formula is a subformula of 
the latter. 

Each formula part (or whole) is identifiable as an ancestral image of a 
specific part in the endsequenf, the former formula part is identical with the 
latter, or comes from it by free substitutions of terms for free occurrences of 
variables. 

Each occurrence of an operator or predicate parameter is identifiable as 
an ancestor (or ancestral image ) of a specific occurrence of the same operator 
or predicate parameter in the endsequenf. 

In contrast, Gentzen’s system LK or G1 mentioned above does not have 
the subformula property, since the C of a cut need not be a subformula of 
any of the formulas in the conclusion. Likewise, the Hilbert-type system H 
does not have the corresponding subformula property, i.e. not every for- 
mula in a proof is a subformula of the formula proved. For, when we infer 
B from A and A 3 B by modus ponens, A D B is certainly not a sub- 
formula of B, and A is not necessarily one. 

Thus proofs in H generally go outside the “parts” of the formula F 
being proved. We already mentioned this in § 40 as a reason why the 
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decision problems for provability in formal systems like those we had 
considered (of Hilbert type) cannot be solved almost immediately from 
the definition of “provable formula”. But for the propositional calculus 
formalized by G4, we can solve the decision problem this way, in view of 
the subformula property. In fact, in G4 our search procedure must simply 
terminate with outcome (I) or (II) after at most one step along each branch 
for each occurrence of a propositional connective in F. Using G4a, G4b 
or the systems in IM, there is slightly more to the proof of decidability, 
because of the contraction rules — *C, C— >. 263 Of course, for the classical 
propositional calculus, this only gives a new proof of the decidability, with 
a new decision procedure. For the intuitionistic propositional calculus, this 
is how the decidability was first discovered, by Gentzen 1934-5. 263 

This approach does not work on the predicate calculus, because the 
dissection of VxA(x) or 3xA(x) into A(r) gives rise to infinitely many sub- 
formulas. 264 No method can work (Theorem VII § 45). 

A proof in a Gentzen-type subformula system is in a “certain, however 
by no means unique, normal form” (Gentzen 1934-5 p. 177). In it the only 
concepts introduced are ones which enter into its final result and which 
hence must be applied in winning that result. 265 Its result is built up pro- 
gressively out of the components of the result (subformulas); nothing is 
first built up and then torn down. “It makes no detours (Umwege).” 

We shall speak of such a proof as “direct”. 266 

In informal mathematics and in mathematics as formalized in a Hilbert- 
type system, the proofs are not ordinarily “direct” in the present sense. 
As a hypothetical example, suppose we have proved in elementary number 

253 Cf. IM pp. 482-485. 

264 However, this approach is used in IM §80 to show some classically provable 
formulas of the predicate calculus to be intuitionistically unprovable. The intuitionistic 
unprovability is shown by identifying an infinite branch in the sequent tree constructed 
upward using the Gcntren-type system G3. Also cf. Curry 1950. 

m Exception : (individual and) function symbols may be introduced which are not in 
the endsequent, if they are allowed under the formation rules of the system in question. 
But, by our completeness argument (§§ 48-50), any provable (and hence valid) sequent 
can be proved in that system G4 which has only its own (individual,) function and 
predicate symbols: Or simply in proof theory, in a given proof we can replace the 
(individual,) function and predicate symbols not present in the endsequent (or in H, in 
the endformula), as in Theorem 31 § 29 and in the proof-theoretic treatment of sub- 
stitution (Footnote 86 § 25). 

«• We are using “direct” here in a more general sense than at the ends of §§ 13, 25. 
There a proof or method of proof was considered “indirect” when, to prove F, we begin 
by deducing a contradiction from “IF (thus proving “ I - IF, before using — 1-elimination) 
or use some variant of that device. 

In §§ 13 (early), 21, 23 etc., “direct” has still another sense in “direct rule” vs. 
“subsidiary deduction rule”. 

The directness of a proof in G1 without cut compared to GI using a cut is illustrated 
in IM pp. 448-449 Example 1. 
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theory that each prime number has a certain property B. We might put 
this result in a text book as Theorem 1066. Then, later we might apply 
Theorem 1066 to conclude that 7 has the property B. Would this be a 
direct proof (from first principles) of the latter proposition? Hardly. It 
could be very indirect indeed, if the proof of Theorem 1066 involved diffi- 
culties not having to be faced in the case of the prime 7, but perhaps 
alfecting 61 and 2 2 '- 81 — 1. 

If E is the conjunction & . . . & E k of the closures of the axioms 
A,, . . . , A,, of number theory used in proving Theorem 1066, and also 
sufficing to establish that 7 is a prime, the foregoing proof can be formal- 
ized in the predicate calculus. First, ED Vx(Pr(x)D B(x)) (the formula 
expressing Theorem 1066) is proved; then E D Pr(7); and finally, using the 
axiom Vx(Pr(x) D B(x)) D (Pr(7) D B(7)) by the V-schema, and steps in the 
propositional calculus, E D B(7). This would not be a very direct proof of 
I D B(7) m the predicate calculus. 

Whether direct proofs are what we want in a given situation depends 
on our purposes. 

If we have Theorem 1066 already proved anyway, it is much more 
efficient to mh! a short deduction of E D B(7) than to construct a direct 
proof of E D B(7) from first principles. 

In developing mathematics, we usually strive to produce general 
theorems, which we store along the way for later use in particular applica- 
tions or in establishing further general theorems. This is why the Hilbert- 
type systems lend themselves well to the formalization of mathematics as 
it is actually developed. When we are working in number theory, we would 
not carry the “E D” as in the above example, but would simply include 
A,, .... A, among the axioms of the system (like N of § 38). 

Nevertheless, i( was a major logical discovery by Gentzen 1634-5 that, 
when there is any (purely logical) proof of a proposition, there is a direct 
proof.-' 17 The immediate applications of this discovery arc in theoretical 
logical investigations, rather than in building collections of proved 
formulas.- 0 '' - 

I lerbrand 1 930 showed that any provable formula of the predicate calculus can be 
proved without using modus ponens in the part of the proof employing quantifiers 
(§ 55). Gent/en reached his "Hauptsatz” or ‘'normal form theorem” by following his 
own direction of investigation, not using this result of Herbrand. Gcntzcn 1934-5 is 
crystal clear, while Herbrand 1930 has only recently been fully elucidated. 37 '' 

But Gentzen’s work, and related work of others, has a practical side. We have 
seen this in connection with our format (B 2 )end § 13. This can be obtained by taking 
a suitable Gcnt/en-iypc sequent system, and using the sequents without each time 
writing the formulas of the antecedent. Thus Gentzen 1934-5 obtained “natural 
deduction systems". Jaskowski 1934 introduced such systems directly, at the suggestion 
of Lukasiewicz in seminars in 1926. Beth’s semantic tableaux are closely related, 331 
But wc have preferred in our practice of logic to operate flexibly using derived rules, 
rather than to freeze our practice into a particular such system. Cf. § 25, Prawitz 1965. 
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Consider any (consecutive) part A of a given formula B not containing 
~ (or at least not containing ~ with A in its scope), this part being a 
formula, an operator or a predicate parameter. As in Theorem 24 (for a 
formula part), we say A is a positive or negative part of B, according as A 
stands in the D of an even or odd number of parts of B of the forms ~iD 
and DDE (where D and E are formulas). For example, in 
3x(P D Q(x)) D (P D 3xQ(x)), the first D is negative, the other two positive; 
the first P is positive, the second negative; and the first Q(x) is negative, 
the second positive. 

Now if we take B as one of the formulas A in a sequent A — » A, the 
positive and negative parts of B are considered as Positive and Negative 
parts of the whole sequent A — > A; but if B is one of the formulas A, we 
reverse these Signs. Equivalently, a part A of a sequent A — » A not con- 
taining ~ is Positive or Negative according as it occurs in the dotted posi- 
tion relative to an even or odd number of the symbols . . . , . . . D, 
To emphasize the difference between these notions applied to A 
considered as a part of a formula B and considered as part of a sequent 
A — » A having B as a succedent or antecedent formula, we are capitalizing 
in the latter case. For example, in 3x(P D Q(x)) P D 3xQ(x), the first P 
is negative (as a part of 3x(P 3 Q(x)) but Positive (as a part of the whole 
sequent). 

Lemma 14. (Sign property.) In a proof in G4, G’4a or (74b of a sequent 
A —* A not containing each image of a Positive part of A — > A is Posi- 

tive, and each image of a Negative part of A — > A is Negative. 

This may be observed in our examples, and verified as true in general 
by examining the 12 logical rules other than — and ~ *, and for (74a 
or (74b the 4 structural rules. 

By Lemma 14, it is predetermined by the Sign of each formula part in 
a sequent A — * A not containing ~ whether, as the rules are used upward 
from A — » A, that part can have an image which is a succedent formula 
(the Sign is Positive) or an antecedent formula (the Sign is Negative). 269 

The two rules — and 9 are unlike the others in that, with A ~ B 
as the principal formula on a given side of the arrow, we get two occur- 
rences of A as side formula one on each side of the arrow, and likewise of B. 

If wc start with a logical problem stated using ~, we can apply the 
foregoing notions of “Sign” etc. after replacing each part A ~ B by 


26# The subformulas of a given formula without ~ as a succedent or antecedent 
formula are accordingly classifiable as succedent subformulas (Positive) and antecedent 
subformulas (Negative). Cf. Kleene 1952 pp. 10-11, which the present discussion 
(including Lemma 14) follows in substance. The terms “ancestor”, "descendant” 
and “image” are from that paper. “Subformula” should have been defined there as in IM 
or here (as noted by V. P. Orcvkov); this is corrected in the 1967 edition. 
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(A 3 B) & (B 3 A), using *63a in Theorem 2. Equivalently, we can apply 
the foregoing discussion directly, by considering each formula part within 
the scope of exactly n occurrences of ~ as having “multiplicity” 2"; we 
imagine 2" replicas of the same part superimposed, 2"~ I of them Positive 
and 2“ 1 of them Negative. 

Exi.KC'isrs. 54. 1 . Show that in G4a and G4b, without changing the class 
of provable sequents the axiom schema can be simplified to C — *• C, the 
two rules &— ► and — »V can be simplified by omitting either side formula 
(producing four rules), and the two. rules V— » and - >3 can be simplified 
by omitting the principal formula in the premise. (From G4b, this gives a 
system Ga differing from G of Klcene 1952 only in its having the two 
~-rules. In IM and Klecne 1952, ~ is not a primitiye symbol.) 

54.2*. Show that, if F F in //, then there is a proof of F in which there 
occur only 3, ~t and those of &, V, • — -, V, 3 which occur in F, in the system 
// or (if V but not & occurs in F) in the extension of H having the axiom 
schema Vx(C 3 A(x)) 3 (C 3 VxA(x)) where C does not contain x free. 
(IM Theorem 49 p. 459.) 

*§55. Permutability, Herbrand’s theorem. Often logicians are 
concerned with the conditions under which certain kinds of formulas or 
sequents are provable. 

Since proofs in the systems G4, G4a and G4b are ‘‘direct” (§ 54), it is 
easier to analyze their structure and thus to extract information from the 
supposition that a proof exists in these systems than in systems like H. 
So a logician may begin an investigation by using the'Gentzen theorem to 
infer that, if a formula F is provable in H (or is valid), the sequent — > F 
is provable in G4, G4a or G4b. 

In a proof in G4, G4a or G4b, we say a given logical inference belongs to 
a given operator occurrence in the endsequent, if the principal operator 
of the inference is an ancestor of the operator occurrence. Using Lemma 
13, a proof in G4 consists entirely of inferences belonging to the various 
operator occurrences in the endsequent, in some order. In G4a or G4b, 
there may also be structural inferences. By Lemma 14, without ~ only a 
specific one of the 12 kinds of logical inferences other than — and 
can belong to a given operator occurrence in the endsequent, namely an 
inference by the succcdent or antecedent rule for that operator according 
as the occurrence is Positive or Negative. 27 " 

For example, a proof in G4, G4a or G4b of the sequent 


-> 3,x(P 3, Q(x)) 3, (P 3, f 3,xQ(x)) 


' 11 " The method of adapting the discussion to the presence of ~ was indicated in the 
final paragraph of S 54. 
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(where we now number the operators for reference) consists of inferences 
3 *-*, - 0 5 , - 0 4 , — > 3 3 

belonging to the respective operators (as indicated by the matching num- 
bers), besides possibly structural inferences. 

In general, there may be zero, one, or more than one inference belonging 
to a given operator occurrence in the endsequent. 

To complete our discussion of Gentzen’s “normal form” for proofs, it 
remains for us to consider the order in which the logical inferences 
belonging to the various operator occurrences are assembled (along with 
structural inferences). 

We shall find that, in proofs in G4a or G4b with the extra freedom that 
the structural inferences give us, we can choose the order of the inferences 
to suit our purposes, within certain limitations (stated below). Thus by a 
rearrangement or “permutation” of the logical inferences in a given proof, 
we may be able to bring it to a form from which we can see more easily 
what is happening. 

Example 8. Suppose that the two inferences — >V and — >& shown 
below at the left occur as part of a proof in (74a or (74b. The principal 
formula of the upper inference is not a side formula of the lower one. This 
part can be replaced, without spoiling the whole as a proof in <74a or (74 b, 
by the figure shown at the right with the — »& over the — >V. Briefly, the 
can be lifted over the — »V, or “interchanged” with it. 

r-»Q,P(b),Q 

r -> 0, VxP(x), Q v r -*■ 0, VxP(x), R(a) _ 
r -> 0, VxP(x), Q & R(a). 

r -> 0, P(b), Q r -» 0, VxP(x), R(a) 
r — » 0, P(b), VxP(x), Q & R(a) 

T -» 0, VxP(x), VxP(x), Q & R(a) 
r — ► 0, VxP(x), Q & R(a). 

In Example 8, we could not have simply interchanged the -*& with the 
— >V, if the upper figure had had P(a) instead of P(b). For then the restric- 
tion on variables for the -*V (§ 48) would not be met after the interchange. 

Such difficulties will not occur in a pure variable proof, i.e. a proof in 
which no variable occurs both free and bound and in which the b of each 
— »V or 3— » occurs only in sequents above its conclusion. (If the A(x) does 
not contain the x free, we can pick the b for the analysis toward meeting 
this condition.) 

In (74, (74a or (74b, a given proof of a sequent A — > A containing no 
variable both free and bound can be changed to a pure variable proof of 
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the same sequent, simply by changing occurrences of variables in it to 
occurrences of other variables. We establish this Pure variable lemma 
thus. 

Consider a given proof of such a sequent A — * A. By Lemma 13 (the 
last part), every variable occurring bound anywhere in the proof occurs 
bound (and hence by the hypothesis, not free) in the endsequent A — > A. 
Each of these variables that occurs free (elsewhere) in the proof can be 
changed in all its free occurrences to a different variable not previously 
occurring in the proof. Now, with each — »V- or 3— ^-inference in the 
resulting proof of A — ► A, we associate a family of sequents, as follows. 
If the A(x) of the inference does not contain the x free, the family is empty. 
Otherwise, the family consists of the premise of the inference and all other 
sequents containing the b free which can be reached from the premise by 
consecutive steps in the tree through such sequents. By the restriction on 
variables for the rules — >V and 3— >, such a family does not include the 
conclusion of the inference; and two such families with the same variable 
as the b do not overlap (or the restriction would be violated for the upper 
inference). Now to get a pure variable proof of A -*• A, we need only 
consider the families whose b occurs elsewhere in the proof, and change 
the b throughout each such family to a different variable not previously 
occurring in the proof. 271 

Example 9. The following illustrates lifting a 1 -premise inference over 
a 2-premise inference. 

r->6, P(b), Q r -» 0, P(b), R(a) 

T -> 0, P(b), Q & R(a) 

!'-»<:>, VxP(x), Q & R(a) 

r->0, P(b),Q r-»0,P(b), R(a) 

r -> 0, VxP(x), Q r -» 0, VxP(x), R(a) 

r 0, VxP(x), 0 & R(a). 

If this interchange is performed in a pure variable proof, the result is not 
a pure variable proof. But it can be made one just by changing some 
variables, according to the pure variable lemma; indeed, we need only 
change all the b’s over one of the two new — »V’s to a variable c not 
previously occurring in the proof. 

Example 10. Consider again the sequent 

“>3 2 x(P O l Q(x)) 3 5 (P 3 4 3 3 xQ(x)). 

271 Our search procedure in §§ 49-50 will lead to a pure variable proof (when (II) 
holds), if it is modified in one respect. Whenever a branching (2-premise inference) 
occurs, we then split the part of the list a 0 , a,, a a , . . . not yet activated into disjoint lists, 
one to be used with one premise and one with the other. 
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How much freedom do we have to choose the order of the inferences 
3*-*» — °5> ~°4, ~*3 3 

in proving it? One of the permissible orders (— >3 3 , Oj— ^ ►, 3 2 -*, — »n> 4 , — Oj 
reading down) is given in Example 2 § 48. Clearly, the inference(s) —* 0 5 
must come below all the others. For, the other operator occurrences in 
the endsequent are within the scope of Z) 6 ; so the inferences used to 
introduce those operators must come above in the process of building up 
the side formulas for -h O s . For the same reason, each 3 2 — ► must come 
below each — O x ; and — s 0 4 must come below — »3 3 . There is one other 
restriction. If in Example 2 we first lift ► over — »3 3 so that 3 2 — » comes 

just under —►3 3 , then we cannot next lift 3 a — » over — »3 3 . For if we did, 
the restriction on variables for 3 a — > would be violated, because a 0 would 
be free in its conclusion. 

Example 10 illustrates the only two obstacles to interchanging adjacent 
logical inferences, i.e. ones with at most structural inferences between. 
It can be verified that the following interchange lemma holds: When 
neither of these obstacles is present, two adjacent logical inferences can 
always be interchanged in a pure variable proof in G4a or G4b, preserving it 
as a pure variable proof of the same sequent . 272 

Now we formulate what this allows us to do to a proof as a whole. It 
should be reasonably evident that we can get the following result, by a 
finite number of successive interchanges of adjacent logical inferences. 
(If persons of several families are in a line, the families can be brought 
together by a finite number of interchanges of adjacent people in the 
line.) 

Let the operator occurrences in the endsequent A—* A ( without ~) of a 
pure variable proof in (74a or G4b be put into q classes C x , . . . , C, ; from 
“ highest ” to “ lowest ” so that (1) each operator occurrence within the scope 
of another is in the same or a higher class than the latter, and (2) each 
operator occurrence of the kinds V— » and —■ >3 is in the same or a higher 
class than any operator occurrence of the kinds —»V or 3— ►, unless the 
latter occurrence is within the scope of the former. Then the proof can be 
rearranged ( preserving it as a pure variable proof of A — * A) so that along 
any path ( zero or more) inferences ( belonging to operator occurrences) of 
Ci come highest, of C 2 next, . . . , of C „ lowest. This is the permutability 
theorem (Kleene 1952). 

We apply this to a pure variable proof in G4a (obtained if necessary 
from a given proof by using the pure variable lemma) of a sequent A — * A 

s,a In Kleene 1952, the interchange lemma is established as Lemma 7 for a system 
O' diHering mcsscnlially from G4b (cf. Exercise 54.1), using an exhaustive classification 
of the cases, and the permutability theorem (next) is proved thence as Theorem 2. 
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whose formulas are all prencx (§25). Let C, be all the propositional 
operators and C., all the predicate operators in A— »A, Because the 
formulas of A— ».\ tire prencx, (1) is satisfied. Because all predicate 
operators are put together, (2) is satisfied. So the proof can be rearranged 
to put all the propositional inferences above all the predicate inferences. 271 ’ 
Reading up, no branching can occur until propositional inferences are 
reached. Now consider the sequent which is the premise of the highest 
predicate inference. It may contain formulas with quantifiers (necessarily 
at the front, since its formulas are subformulas of prenex formulas). But 
no ancestor of one of these formulas plays the role of a principal formula 
higher up (since all predicate inferences are below). Hence, since in G4a 
the C’s for axioms arc prime, no ancestor of one of these formulas plays 
the role of the C in an axiom. 273 So if all the quantified formulas are 
simply omitted from the sequent in question on up, and any resulting iden- 
tical inferences (with premise and conclusion the same) arc suppressed, 
we get a proof of the resulting sequent. From it by thinnings we can 
continue to the sequent in question. Thus: 

Gircn an r proof in G'4a of a sequent A — > A containing only prenex 
formulas , and containing no variable both free and bound, a pure variable 
proof in 6'4a of A — > A can be found in which there is a sequent (called the 
midsequent) containing no quantifier, with only propositional and structural 
inferences above it, and only predicate and structural inferences below 
it. 2 '* This is Glnizi.n’s sharpened (or extended) Hauptsatz 1934-5. 275 

The significance of this result appears when we reflect on the rules by 
which the part of the proof from the midsequent down is conducted. 

'- 7J For the approach via Gcntzen’s Hauptsatz, 20 " it is necessary to proceed from 
(,'4b toward G'4a only far enough to obtain quantifier-free axioms. (Cf. IM p. 461.) 

271 It is not hard to see that thinnings — *T and T-* can be avoided from the mid- 
sequenl down (Lxercise 55.3: solution in Kleenc 1952 Lemma 4). 

2,r ‘ Gont/on’s “verscharfter Hauptsatz” (but proved entirely mctamathematically) 
is a convenient starting point for consistency proofs for various systems weaker than 
N. Cf. § 54 Paragraph 4, and IM § 79. 

Gentzen's sharpened Hauptsatz requires only the part of the permutability theorem 
by which propositional inferences can be moved up through predicate inferences whose 
principal formulas are not side formulas for the propositional inferences. 

However, it has required only a little more space to formulate the more general 
permutability theorem (though of course more cases have to be checked in proving it). 
Gentzen 1934-5 p. 412 hinted at such a permutability theorem, without exploring the 
possibilities. Also cf. Curry 1952. 

The classical and intuitionistic versions of the permutability theorem were used by 
Klecne 1952a in a rather complex consistency argument. (Lemma 9 is the proposition 
of Footnote 107 §29, established metamathematically for both the classical and the 
intuitionistic systems.) This exercise convinced the author that Gentzcn-type systems in 
essentially Gentzen’s own form arc a very effective tool for such purposes. 
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To give an illustration, suppose the formation rules in use provide 
(besides predicate parameters or ions, and variables) just one individual 
symbol x and one 1-place function symbol X. 27,i Suppose A(w, x, y, z) is a 
formula containing no quantifiers. Suppose that f 3wVx3yVzA(w, x, y, z) 
in H, and that after applying Gentzen’s theorem and his sharpened 
Hauptsatz we have a pure variable proof in (74a of 
— » 3wVx3yVzA(w, x, y, z) in which the part from the midsequent down 
is as follows, with the variables a, b, c, d, e, f, w, x, y, z distinct from each 
other and from all other variables (if any) in A(w, x, y, z). 


A(X(b), d, x, e), A(a, b, X(c), f), A(a, b, b, c) 

— > A(X(b), d, x, e), VzA(a, b, X(c), z), A(a, b, b, c) 

-> A(X(b), d, x, e), 3yVzA(a, b, y, z), A(a, b, b, c) 

-> VzA(X(b), d, x, z), 3yVzA(a, b, y, z), A(a, b, b, c) 

~> 3yVzA(X(b), d, y, z), 3yVzA(a, b, y, z), A(a, b, b, c) 
-> Vx 3yVzA(X(b), x, y, z), 3yVzA(a, b, y, z), A(a, b, b, c) 
— > Vx3yVzA(X(b), x, y, z), 3yVzA(a, b, y, z), VzA(a, b, b, z) 
-» Vx3yVzA(X(b), x, y, z), 3yVzA(a, b, y, z) ^ 

— »3wVx3yVzA(w, x, y, z), 3yVzA(a, b, y, z) 

— > 3wVx3yVzA(w, x, y, z), Vx3yVzA(a, x, y, z) 

-* 3wVx3yVzA(w, x, y, z). 

Thus the midsequent is 

(i) -> A(X(b), d, x, e), A(a, b, X(c), f), A(a, b, b, c). 


Certain structural details which are significant here do not stand out 
clearly. We shall now see how (i) can be altered to a more revealing form 
(though it will no longer be a midsequent), by introducing symbols for 
functions called “Herbrand functions” or “Skolem functions”. 277 

We begin by augmenting the symbolism of our Gentzen-type system 
G4a by two new function symbols fJ( — ) and 8( — , — ). We consider 
(3( — ) and 8( — , — ) as corresponding to the universal quantifiers Vx and 
Vz in the formula of the endsequent, noting that it is a succedent formula. 
(For an antecedent formula, we would introduce function symbols 
corresponding to the existential quantifiers.) 

1,1 Since we are forced here to use such a large part of the Roman lower case alphabet 
“a”, “b”, “c”, . . ., “x”, "y”, “z” as names for variables, we are now using lower case 
Greek letters “a”, “(3”, “y”, ... as names for function symbols (instead of “f”, “g”, 
“h”, ... as in § 28 etc.). 

277 Such functions were used by Herbrand 1930, and earlier by Skolem 1920, 1922-3, 
1929 and implicitly by Lowcnheim 1915. Cf. Footnote 252 Paragraph 3. 
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Now wc take the fragment of a pure variable proof shown from end- 
sequent up to midsequent, and perform on it a series of 5 substitutions, 
one to each — »V-infercnce. (More generally, we would do the like with 
all ~»V- and 3— ^-inferences.) We begin with the lowest — >V, in which the 
variable b (not occurring below its premise) is introduced in the side 
formula 3y VzA(a, b, y, z) as we read the proof-fragment upward. Through- 
out the fragment, for all the 31 occurrences of b we substitute the term 
[3(a). Of course, this spoils the fragment as a part of a proof in G4a 
(actually, it spoils only the — »V under consideration); but we don’t 
care. We are using the fragment, and successively altering it, as a way of 
discovering a new sequent (ii), having a more revealing form than the 
midsequent (i). Now we take the next — >V (reading up) in the already 
once-altered fragment; its altered side formula is A(a, (3(a), (3(a), c). For 
all the 8 occurrences of c (which are only from there up) we substitute 
8(a, (3(a)). We treat the remaining three — »V’s (in the now twice-altered 
fragment) in succession, similarly. That is, when the side formula of an 
— »V has become 3yVzA(r, v, y, z) for some term r and variable v, we 
substitute for all v’s the term (3(r); when it has become A(r, s, t, v) for 
some terms r, s, t and variable v, we substitute for all v’s the term 8(r, t). 
When the succession of 5 substitutions has been completed, the mid- 
sequent has become 

( ii) -+ A(X((3(a)), (3(A((3(a))), x, 8(X([3(a)), x)), 

A(a, (3(a), X(8(a, (3(a))), 8(a, X(8(a, (3(a))))), 

A(a, (3(a), (3(a), 8(a, (3(a))). 

The structure exhibited in (ii) provides a “history” of the steps in G4a 
by which we proceeded upward from the endsequent 
— * 3wV.\3yVzA(w, x, y, z) to the midsequent (i). From the history 
exhibited in (ii), we can rediscover (i) and the steps from (i) down to 
— > 3wVx3yV/A(w, x, y, z), apart from inessential ^details (Exercise 55.1). 
From any similar history, we can do the like. This gives a preview ol what 
we will do presently. 

Meanwhile, let us examine the result we thus far have. 

Imagine that at the beginning of the substitutions we had before us, 
not just the fragment from the midsequent (i) down, but a whole proof in 
G4a ol' —* 3wVx3yVzA(w, x, y, z) as described in Gentzen’s sharpened 
Hauptsatz. The part from the midsequent (i) up consists of propositional 
and structural inferences and axioms. Suppose now that the substitutions 
of (3(a) for b, of S(a, (3(a)) for c, . . . were carried out on the whole proof, 
including the part above the midsequent. Those substitutions do not 
invalidate the applications of the 10 propositional rules or the 4 structural 
rules of G4a, or of the axiom schema (x). So, whereas originally the 
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part above the midsequent was a proof of (i) in the system of propositional 
calculus G4a (i.e. the predicate calculus G4a minus the 4 predicate rules), 
after the substitutions it has become a proof of (ii) in the propositional 
calculus G4a (but with the symbolism augmented by (3, 8). So by Theorem 
33 the sequent (ii) is valid, and hence by Exercise 50.1 (b) the formula 

A(X((3(a)), p(X(P(a))>, x, 8(X(p(a», x)) V 
A(a, (3(a), X(8(a, (3(a))), 8(a, X(S(a, p(a))))) V 
A(a, (3(a), (3(a), 8(a, (3(a))) 

is valid, i.e. is a tautology under the truth tables of the propositional 
calculus § 2 (or equivalently by §§ 1 1, 12 is provable in it § 9). 278 

The “only if” part of the following proposition is obtained by recogniz- 
ing that for any proof of — » 3wVx3yVzA(w, x, y, z) as described in 
Gentzen’s sharpened Hauptsatz, the result of our substitution method on 
the part from the midsequent down must lead to a valid disjunction of 
the form exhibited; i.e. our (i) and (ii) are “typical”. 

A prenex formula F of the form 3wVx3yVzA(w, x, y, z) ( with all its 
quantifiers shown) is provable in the predicate calculus H if and only if 
some disjunction of the form 

A(t n , P(tn), t 12 , 8(t u , tia)) V ... V A(t ( i, (3(tu), t I2> 8(t tl , t (2 )) 

(a Herbrand disjunction) is valid (or equivalently, is provable) in the propo- 
sitional calculus H. Here t u , . . . , t (2 are terms constructed from variables, 
the ( individual ,) function and predicate symbols of F, and the additional 
l-place function symbol (3 and 2-place function symbol 8. This is a version 
of Herbrand’s theorem 0 1930, illustrated by the case of 
3wVx3yVzA(w, x, y, z). More precisely, it is one of the propositions 
which occur in the literature as partial versions of or as included in the 
“fundamental theorem” of Herbrand’s thesis 1930. Another such version 
coincides with Gentzen’s sharpened Hauptsatz (with Footnote 274) as 
applied to — > F. There is still more to the theorem, however. 27 " 

2,8 Alternatively, we can observe that Theorem 36 and Corollary hold for the prop- 
ositional calculi G4a and H; i.e. their proofs require the postulates of the predicate 
calculus in H only when the given deduction in G4a uses the predicate rules — >V, . . . , 
3 — >. Applying Corollary (c) to (i), A(X(b), d, «, e) V A(a, b, X(c), f) V A(a, b, b, c) is 
provable in H, and hence by Theorem 12 valid. Finally, we can observe that the sub- 
stitutions of 3(a) for b, of 8(a, 3(a)) for c, . . . simply effect a substitution for the atoms 
in the sense of Theorem 1 § 3. 

2,8 Another version, in Hilbert and Bernays 1939pp. 163-178, follows from the present 
treatment without adding any major ideas. These several versions all apply to a prenex 
formula, while Herbrand stated and attempted to prove his theorem for an arbitrary 
formula. 

Herbrand died in 1931, aged 23. Gentzen 1934-5 and Hilbert and Bernays 1939 gave 
clear treatments of the versions mentioned, proceeding from two other directions than 
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We still have to prove our “if” part. 280 So assume that some Herbrand 
disjunction as displayed is valid. We must show that then 
3wVx3yVzA(w, x, y, z) is provable in H. There is no loss of generality in 
supposing the / disjunctands to be distinct, since if they were not we 
could suppress the duplications and still have a tautology. Likewise, we 
can assume all variables in the Herbrand disjunction to be distinct from 
w. x, y, z; for otherwise, using Theorem 1 § 3 we could make some 
changes in them to secure this. 

Consider the distinct terms of the forms (3(s) (s a term) and 8(s, t) 
(s. t terms) occurring in the Herbrand disjunction (or if w, x, y, z do not 
all occur in A(\v, x, y, /.), occurring in the terms shown in the expression 
for the Herbrand disjunction displayed above). List them as 

t<)> * * ■ > 

in such an order that those terms occurring as parts of s ( including possibly 
s itself) precede 3(s). and 3(s) and those terms occurring as parts of t 
(including possibly t itself) precede S(s, t).We can certainly list them in such 
an order. 1 or, we could enumerate all the terms constructible using the 
variables, individual and function symbols of the Herbrand disjunction 
(including (i, 5) by the method of digits with an alphabet in which (3 pre- 
cedes t> ((A) § 32). The p+ 1 terms t 0 , . . . , t„ under consideration would 
occur in that enumeration with the required kind of order. 

Next, to the terms t 0 , . . . , t„ we correlate respective variables 

a 0 , . . . , a„ 


Herbrand. These are the present writer's sources. Hilbert and Bernays (1939 p. 158) 
wrote, "Merbrand’s argumentation (Beweisfuhrung) is difficult to follow”. In 
1963 Dreben, Andrews and Aanderaa established that two of Herbrand’s lemmas are 
false. It is only in the translation and recension by Dreben and van Hcijenoorl in van 
llcijenoort 1967 that Herbrand’s text has become accessible without undue eirort. 
Denton and Dreben 1968 workout the main ideas of Herbrand’s thesis, obtaining among 
other things a connection between modus ponens (or cut) elimination and the consistency 
of number theory which is absent in Gentzcn's consistency proofs 1936, 1938a. 

ssl ’ The proof of Godel’s completeness theorem in Kleene 1958 leads directly to 
(the present part of) Herbrand’s theorem as the basic discovery (instead of the complete- 
ness of the Gcntzen-type system G4). This it does by refining the easy axiom-of-choice 
proof of the Lowenheim-Skolem theorem (Footnote 252 Paragraph 3) to pick truth 
values of atoms toward building a satisfying assignment if there is one (Footnote 252 
Paragraph 5). In this it is very similar to the proofs of the basic discovery in Skolem 
1922-3 and 1929 (of which Kleene was unaware in 1958). The rest of the proof (in 
Kleene 1961) is the present proof of the “if” part of Herbrand’s theorem. 

The proof in IM pp. 389-393 is quite similar, though it is presented rather differently. 
It was written immediately after the appearance of Henkin 1949, adapting an idea 
from that to the proof of the first lemma. 
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distinct from each other, from all the variables in the Herbrand disjunction, 
and from w, x, y, z. 

For any term r in the Herbrand disjunction, let r come from r by 
replacing each maximal (consecutive) part of r of the form (3(s) or 
8(s, t) (s, t terms) by the correlated variable. A maximal such part is one 
not constituting a proper part of another such part. By the theory of 
proper pairing cited in § 38, it can be seen that any two distinct maximal 
such parts are nonoverlapping. 

The Herbrand disjunction (assumed to be valid) will remain so when 

each of its atoms P(r x , . . . , r„) is replaced by P (r x r„). For, like 

atoms will receive a like alteration, which makes the operation a sub- 
stitution in the sense of Theorem 1 § 3. So t= D in the propositional 
calculus H, where D is the resulting disjunction. 

But (3 and 8 are function symbols not in A(w, x, y, z). Hence the 
operation of replacing maximal parts |3(s) and 8(s, t) by the respective 
variables takes place entirely within the shown terms t ll; (3(t a ), t i2 , 
8(t n , t l2 ) (/ = 1, . . . , /) which are substituted for w, x, y, z; i.e. none of 
the shown terms is part of a larger term in a disjunctand 
A(t„, P(t (1 ), t (2 , 8(t„, t, 2 )) which larger term gets replaced in toto. So 
the result of the replacement operation on the /th disjunctand can be 

written Aft,^, P(t„), t, 2 , S(t a , t l2 )). 

Now cither by the completeness of the propositional calculus H 
(Theorem 14 § 12) and the Gentzen theorem (§ 54) and — >V used upward 
/—I times (as the permutation theorem allows), or directly by reasoning 
in §§ 48 and 49, the sequent shown at the top of the next figure (written 
with / = 2 to simplify the notation from this point on) is provable in 

G4a. 

-» A(tTi, P(tn), W 8(t„, t L2 )), A(t 81 , P(t 81 ), t 22 , 8(t n , t, 2 )) 



348 


THU PREDICATE CALCULUS (ADDITIONAL TOPICS) CH. VI 


So the “rightmost (free) variable” &(t„, t l2 ) in the /'th formula of the top 
sequent (it is a variable, because S(t ; ,, t, 2 ) is a maximal part 8(s, t)) comes 
later in the list a u , . . . , a„ than any other variable in that formula. So 
among the rightmost variables §(t n , t 12 ), S(t 21 , t 22 ) (which are distinct 
since (t u , t 12 ), (t 2 „ t 22 ) are), one comes furthest out* in the list a„, . . . , a p 
(indeed it must be aj; to fix the notation, say this one is 8(t u , t 12 ). All 
variables in a 0 , . . . , a„ arc distinct from the other variables in the sequent 
(i.e. the variables if any which were in the Herbrand disjunction and were 
not removed in the replacement of maximal parts (3(s) and &(s, t)), and 
from w, x, y, z (which do not occur in the sequent). Thus 8(t u , t 12 ) occurs 
in the top sequent only as shown, and w, x, y, z do not occur. Hence the 
requirements are met for an application of — >V, with the sequent shown 
second (reading down) as conclusion. In this sequent, w, x, y, z occur 
only as shown, and thus no variables are bound in the parts t u , (3(t n ), 
t 12 , t S |, p(t 21 ), t., 2 , 8(t 2 i, t 22 ). Using this fact (with respect to t 12 ), we can 
apply — >3 in (74a (where the principal formula need not appear in the 
premise) to t, 2 (which we consider as “uncovered” by the conversion of 
8(t n , t 12 ) to z) to obtain the third sequent (reading down). 

By reasoning as before, f}(t u ), 8(t 21 , t 22 ) are variables further out in the 
list a 0 , . . . , a„ than other variables in their respective formulas in the 
third sequent; and so one of them is furthest out of all (indeed it is a p -i), 
and occurs only as shown (as do w, x, y, z). Say e.g. this one is S(t 2l , t 22 ). 
Then by — we can infer the fourth sequent, with w, x, y, z occurring 
in it only as shown. While we had that (t„, t 12 ), (t 21 , t 22 ) are distinct 
pairs of terms, tu and t 2X are not necessarily distinct terms. Say e.g. they 
are not distinct. Then in the — >3 which we can now perform since t> 2 is 
uncovered, the first formula of the premise is the principal formula, so we 
get the fifth sequent as shown (or we could first use — >3 without regard to 
t n and t 21 being the same, and then — >C). 

Now an —»V and — >3, each clearly legal, lead to the bottom sequent. 

Thus this sequent is provable in (74a. By the Gentzen theorem (or 
specifically Theorem 36 Corollary (a)), 3wVx3yVzA(w, x, y, z) is provable 
in the predicate calculus H, as we aimed to show. 

We used 1 = 2, and an illustrative series of assumptions. It should be 
clear that the process can always be carried out. Another illustration is 
provided by starting from (ii) with bars placed over the terms (Exercise 
55.1). 

We may summarize Herbrand’s theorem by saying that it reduces the 
question of the provability of a particular formula with quantifiers (in 
the first instance, a prenex formula) to the question of the validity (or 
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provability) in the propositional calculus of some one of a countably 
infinite class of quantifier-free formulas (the Herbrand disjunctions). 

Among the applications of Herbrand’s theorem are those in Hilbert and 
Bernays 1939 pp. 178 ff., in Kreisel 1958, and in Denton and Dreben 1968. 

Exercises. 55.1. Apply the method of the “if” part of the proof of 
Herbrand’s theorem to (ii), as follows. Take for t 0 , . . . , t„ the list 
[3(a), 8(a, (3(a)), (3(X((3(a))), 8(X(|3(a)), *), S(a, X(8(a, [3(a)))) (verify that it 
meets the requirements). Find steps by the method, leading downward in 
(74a to — * 3wVx3yVzA(w, x, y, z). This can be done without specifying the 
variables a 0 , . . . , a„. Now take a 0 , . . . , a p to be b, c, d, e, f; and compare 
your result with the steps originally shown as leading up from 
-* 3wVx3yVzA(w, x, y, z) to (i). 

55.2. Formulate a condition for each of the following, similar to that 
stated for V H 3wVx3yVzA(w, x, y, z) in our illustration of Herbrand’s 
theorem. 

(a) l- w ~i3wVx3yVzA(w, x, y, z) (s f ( , 4a 3wVx3yVzA(w, x, y, z) — >). 

(b) 3wVx3yVzA(w, x, y, z) is satisfiable. 

(c) V I{ VvB(v) & 3rVsVt3uC(r, s, t, u) 3 3wVx3yVzA(w, x, y, z). 

55.3*. Prove the proposition of Footnote 274. 

§ 56. Craig’s interpolation theorem. In this section, we utilize the 
relationships which are present in proofs in (74 or (74a to establish (as 
Theorem 41) Craig’s interpolation theorem or Craig’s lemma 1957, 1957a, 
including a version derived from Lyndon 1959. Until Theorem 42, we 
shall deal with the predicate calculus without equality (without or with 
functions). 281 The main part of our work is put into proving Theorem 40. 

The idea is this. Given a proof in (74 or G4a of E —► F, we can split it 
vertically into two parts: the E -part we obtain by omitting in each sequent 
all the ancestors of the F in the endsequent E— ►F; the E-part y by 
omitting all the ancestors of the E. Of course, the E-part and the F-part 
won’t in general be proofs. But now we ask whether we cannot repair each 
part, or at least one of them, to make it a proof, utilizing at least some 
of its essential anatomy. We aim to make the repair by restoring a 
minimum of what was cut away from the part in the splitting operation 
that cannot be spared for a proof, and by removing what is superfluous 
and in the way in the absence of the other part. 

Clues to what we can do toward restoring the E-part or the F-part to 
become an E -proof or an E-proof will be provided by considering what 
happens to the axioms of the given proof in the splitting operation. We 
call an axiom C, T — > 0, C in the given proof of E — * F an EF -axiom if 

281 The symbol = may be among our predicate symbols, but it is not to have a special 
status (as it did in § 29); i.e. it is to be counted as a predicate parameter, and no axioms 
are postulated for it (until Theorem 42). 
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one of its C’s is an ancestor of E and the other of F; an E -axiom if both are 
ancestors of E; an F -axiom if both are ancestors of F. We have been 
assuming throughout our theory of the relationships within proofs in (74 
or (74a that each proof is given with an analysis which determines the role 
of each formula occurrence in each step. 229 

1 irst, consider an EF-axiom C, F — » (-), C, e.g. with the first C belonging 
to E and the second to F. Under the splitting operation, the E-part 
receives from this axiom C, P B -*C-) K , where P K are the ancestors of E 
among the formula occurrences F and fc) K those among 0. Similarly, the 
1 -part receives P K — » 0,,, C. Toward repairing the E-part to become a 
proof, we evidently should put the removed C back on the right (under- 
lined) to obtain C, r B — »0 K , C, and toward repairing the F-part on the 
left to obtain C, P F — »0 K , C. This of course is just a first step toward 
repairing the two parts. But, when the original proof has only EF-axioms, 
the tops of all the branches in each part can be repaired in this manner. 
Then, as we shall see in proving Theorem 40, we can continue the 
restoration down through the two parts, performing a minimum of 
corresponding steps to combine the restored C’s from the various EF- 
axioms into a single formula I (the interpolated formula). In this manner, 
we can make the E-part into an E-proof of E— M and the F-part into an 
E-proof of I —* F. Here the 1 appears on opposite sides of the arrow in 
the endsequents of the E-proof and the F-proof, as did the restored C’s 
at the tops of branches. It is convenient to construct the restored proofs 
in 64a (§ 54), even if the original proof is in (74. 

Now consider an E-axiom C, P — ► 0, C. In the splitting operation, the 
E-part receives from this axiom the sequent C, r K — i ►0 K , C and the 
F-part the sequent I j. — ► U F . In the E-part, C, P|, ; — >0 K , C is still an 
axiom. We don’t need to make any restoration; both the C’s, which were 
the vital parts at this position in the original proof, have been given to it. 
In the F-part, P,.. — ► is not an axiom, at least not by the analysis used 

in the given proof of E — ♦ F. So (disregarding the possibility of a change in 
the analysis for the formulas P F , 0,,.), we could make an axiom of 
Pi,. — > (-),.. only by bringing in twm occurrences of some formula D to 
obtain D, P,, — M-) r , D. In making an axiom this way, the sequent 
F|. — » from the F-part is not contributing anything essential. We 
might as well have discarded it, and this is what wc shall do. If there are 
only E-axioms in the given proof of E — * F, then starting in the manner 
described at (he tops of branches, and proceeding downward through the 
E-part removing some unnecessary sequents, we shall obtain an E-proof 
of E — while discarding the F-part altogether. 

Similarly, if there are only F-axioms, we shall get an F-proof of — > F 
by repairing the F-part. while abandoning the F.-part. 
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If the axioms of the given proof of E— > F include a mixture of two or 
three of the three kinds EF-axioms, E-axioms and F-axioms, then we shall 
get one of the results described for a pure set of axioms of a kind which 
is represented. 482 

In repairing the E-part and the F-part, one or both, we do the work 
step by step, corresponding to the sequents reading downward in the 
given proof in (74 or (74a of E — * F. So we generalize the conclusion we 
are aiming to establish to get a proposition applicable to each sequent 
A — » A in the given proof. Theorem 40 asserts that each sequent has this 
property. 484 

THEOREM 40. 281 Suppose given a proof in (74 or (74a of E — > F. For each 
sequent A — > A in this proof let A K , A,, ; be those of A, A respectively which 
are ancestors of the E in the endsequent E — * F; A F , A F those which are 
ancestors of the F. For each sequent A—* A in the given proof : 

Either (Cask. (EF)) in the given proof there are EF -axioms over A — » A, 
and there are a formula i (without ~) and proofs in (74a of both A K — ► A K , J 
and J, A|,> — ► A,, such that 

(1) the individual parameters of J are parameters of both A l0 — * A,, : and 
A,,. — » A|,., and 

(2) in the proof of A K — » A K , J, each atomic part of J is an image of one C 
of an a xiom , the other C of which descends into an image in one of A K , A B ; 
and similarly in the proof of J, A,,. — * A K . 

Or (Cask (E)) in the given proof there are E -axioms over A—* A, and 
there is a proof in G'4a of A K — * A K . 

Or (Cask (F)) in the given proof there are F-axioms over A— ► A, and 
there is a proof in (74a of A ,.-— * A . 

Pkooi . Starting at the tops of branches in the given proof of E — ► F, 
and working down step by step in the tree, we can establish for each 

:k ' ! By making. Cor each I. -axiom, ail "unnecessary' - restoration in the E-part and a 
restoration in the l T -part to which its contribution is unnecessary, and similarly for each 
E-axiom, we can come out with an E-proof of E — > I and an F-proof of I — > F in all 
cases. (This amounts to using the treatment of Theorem 41 (a) Cases (E) and (F) in 
G'4b at the stage of the axioms.) This procedure makes for unnecessarily complicated 
I’s, so we prefer the procedure described in the text. 

81,3 By leaving out the part of the given proof from a certain sequent A— » A down to 
E — » F, we have a theorem about a proof in (74 or (74a of any sequent A — ► A whose 
formula occurrences A, A are divided into two lists A K , A E and A K , A K . 

a *' Schutte 1%2 establishes the interpolation theorem (Theorem 41) for the intuition- 
istic predicate calculus, using a formal system without sequents resembling a Gcntzcn- 
lype subformula system. 

To adapt the present treatment to the intuitionistic predicate calculus, we can use 
Theorem 40 with the intuitionistic version of G 3 or C, modifying Case (EF) to say that 
there arc proofs either of both A,. : — > ,\ K , J and J, A K — > A,. , or of both J, A K — > A K and 
A, -> V, J. 
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sequent A — » A in its turn that it has the property ascribed to it by the 
theorem. We now describe what to do in each of the cases that can arise 
in the steps. The use of these instructions is illustrated in Example 1 1 below. 

If the given proof is in <74a rather than <74, then for deliniteness in our 
notation we shall suppose that the logical inferences in it are written as in 
<74, and all thinnings —*T, T~* (as well as contractions — >C, C— >) are 
shown separately. But in writing the E-proof or the F-proof, thinnings 
may be assimilated into succeeding logical inferences. 285 

First, we show in the next table (under Cases la-2b) how to treat 
(what remains from) an axiom C, F — ► fc), C in the E-part (left column) 
and in the F-part (right column) after the splitting operation. Absence of 
an entry in one column indicates that in that part we discard the sequent 
resulting by the split (Cases la, lb). Formulas added are underlined here 
for emphasis (but in later cases the J-notation will accomplish the same 
purpose, and in Example II the use of light- and boldface type). In Case 
2a, the J for the theorem is C, and in Case 2b it is — iC. Clearly (1) and (2) 
hold as required for Case (EF) of the theorem. 


E-proof. Axioms. F-proof. 
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of A, K — ► A iE ; an F- premise if it has provided a proof of A iK — » A, F . We 
classify the inference as an E -inference or an F -inference, according as its 
principal formula (and hence its side formula(s)) if it is logical, or its one 
or three C’s if it is structural, belong to E or to F. 

Consider the simple case (Case 3a) of an E-inference with at least one 
F-premise. Since the principal and side formulas if it is logical (or the 
C’s if it is structural) belong to E, in the F-part the premise(s) and con- 
clusion each become simply r F — >0 F . By definition of “F-premise”, 
our repair of the F-part above the inference in question gives a proof of 
r K — * 0 r . So to extend the repair of the F-part to the conclusion of the 
inference in question, we only need to cut out the F-premise in question 
and connect the tree above it to the conclusion, while discarding the other 
premise if any with everything above it not already discarded. In the 
E-part, we discard the conclusion and everything above it not already 
discarded. This is summarized in the first row of the following table. 

E-proof. Logical and structural inferences (simple cases). F-proof. 

Case 3a. E-inference with at least one F-premise. 

F k -» 0 k . 

Case 3b. F-inference with at least one E-premise. 
r E -> ©e- 


To combine the cases for the 10 propositional rules and 4 structural 
rules, when we can’t just bypass the inference as above, we write the 
conclusion as Il x , E-»0, H* where one of II 4 , II 2 is the principal formula 
(or the C) and the other is empty. We write the premises similarly, where 
each of Ej, S 2 or S lt D 2 , 2 3 , E 4 is zero, one or two side formulas (or 
zero or two C’s). For the rule 3— *, then n lt 11 2 , 2 1( L 2 , S 3 , S 4 are 
A O B, 0, 0, A, B, 0 respectively, where 0 is the empty list. The 
result(s) of our previous treatment of the premise(s) are shown at the 
tops in each of the next figures (top p. 354), and the result of treating the 
conclusion at the bottom. As the figures show, we can get from the former 
to the latter by zero, one or two inferences in C4a. Cases 4b, 5b, 6b, 7b are 
symmetric (or “dual”) to Cases 4a, 5a, 6a, 7a (as 3b, 8b are to 3a, 8a). In 
each of these cases with an EF-premise (and the inference propositional), 
(1) and (2) of Case (EF) in the theorem hold for the J in the conclusion in 
consequence of their holding in the premise(s). 

An — »V-inference is treated under the appropriate one of Cases 3a-5b. 
By the restriction on variables, b does not occur free in the T, 0. So if the 
inference is an E-inference, b does not occur free in r F — » 0 F (the 
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Propositional and structural inferences and 
E-proof, certain predicate inferences (as explained below). F-proof. 



F-part of the premise — » A x ); if an F-inference, not in r K — >0|, ; . 

Hence with an EF-premise (Cases 5a, 5b), because (1) in Case (EF) of 
the theorem is satisfied for the premise, J does not contain b free. Hence 
the new — »V satisfies the restriction on variables, and (1) is satisfied 
for the conclusion. 280 

286 We have added to our preliminary plan (second paragraph of this section) to 
secure (I); i.c. individual parameters not common to the E-part and the F-parl are 
removed from the .1 as soon as t hey cease to be common in treating V — > and — »3. 
Otherwise, we would have to remove b (if present) in treating — >V and 3 — ». 
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Now wc treat V — *. Except in the circumstances described next, this 
can be handled under one of Cases 3a-5b, with the obvious modification 
of Cases 4a-5b that the 1^, Il 2 (actually, ri 2 is empty now) appear in the 
premise also (Cases 4a'-5b'). Suppose the premise is an EF-premise, and 
the A(x) contains the x free. In the predicate calculus without functions, 
the r must be simply a variable. Say the inference is an E-inference, and 
that this variable r occurs free in F f — » 0 K but not in VxA(x), F K —* 0 E . 
Then, under (1) for Case (EF) the J for the conclusion is not allowed to 
contain r free, but the J for the premise may. Supposing it does (Case 9a), 
let us write it “J(r)”; and let y be a variable (maybe x) free for r in J(r) 
and not occurring free in J(r) (unless y is r). 

V— > with r a variable free in 
the J for the premise but not in both 
E-proof. the E-part and the F-part of the conclusion. F-proof. 


Case 9a. E-inference with EF-premise. 

A(r), VxA(x), r K -»0 E , J(r) J(r), lY-K-V 

VxA(x), r K -» 0 K , J(r) VyJ(y), -» 0 K . 

VxA(x), r E -»0 E , Vyj(y). 

Case 9b. F-inference with EF-premise. 

r K -»e K ,J(r) _ 3 J(r), A(r), VxA(x), r y -»0 F 

r* -* e K , 3yj(y). J(r), VxA(x), r K ->0 K 

3yJ(y)> VxA(x), r K -> 0 K . 


In Case 9a, because r does not occur free in VxA(x), F E — * 0 K , the 
— »V is legitimate. Similarly, in Case 9b (with r free in A(r) and r K — > 0 K 
and J(r) but not in VxA(x), F,. — > 0 F ) the 3— ► is legitimate. In the 
predicate calculus with functions (including individuals), more generally 
the r may contain free variables c 1( . . . , c,„ and individual symbols 
e,„ti, • • • . e„ which are parameters of the J for the premise, but are not 
common to the E-part and the F-part of the conclusion (Cases 9a', 9b'). 
Write r also as “r(c l( . . . , c m , e m+1 , . . . , e„)”, and the J for the premise as 
“J(c„ . . . , c,„, e m+ i, . . . , e„)”. For Case 9a' (VxA(x), I\ ; -» 0 K not 
containing as parameters c„ . . . , c m , e wf „ . . . , e,), in the E-proof as 
constructed down through the premise of the inference in question, we 
can change e m+l , . . . , e„ throughout to respective distinct variables 
c,„. h . . . , c„ not previously occurring in that proof, to obtain instead a 
proof of A(r(c„ . . . , cj), VxA(x), r K — ► 0 E , J(cj, .... c„). Now, 
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instead of inferring Vy.!(y) as principal formula from J(r) as side formula 
by one inference in each proof (Case 9a), we can infer 
Vy, . . Vy„J(\,, . . . , y„) from J(c!, . . . , c„) in the E-proof and from 
J(c,, . . . , c,„, e,„ e„) in the F-proof by n inferences of the same 

respective kind. 

The rules 3 — *■ and — >3 are treated similarly, partly under earlier cases, 
and partly as new Cases 10a, 10b, 10a', 10b'. 

Summarizing, we repair the E-part or the F-part or both to obtain an 
E-proof or an F-proof or both, by proceeding downward through the 
given proof step by step, applying the appropriate case at each step. 
This leads effectively to one of the three results described in Cases (EF), 
(E) and (F) of the theorem, determined by the given proof and the given 
analysis of it. This procedure may involve some unnecessary work in 
repairing the upper parts of branches that will later be discarded in 
treating two-premise inferences under Cases 3a, 3b. If we first classify 
the axioms as EF-, E- and F-, and the two-premise inferences as E- and F-, 
we can look ahead and anticipate which branches will be discarded. 

Exampi ill. First, we show a given proof of E — > F in G4a, with the 
E-part in iightfacc, the F-part in boldface. To each numbered sequent 
A— » A in this proof, the like-numbered sequents in the E-proof and the 
F-proof (below it) are the ones reached by repairing the E-part and the 
F-part as far as the corresponding sequents A K — » A K and A,.. — » A K . 287 


EF-axiom 

1. P(a), S , R -> 3xP(x), P(a) 

2. P(a), S, R — > 3xP(x) ^ 

3 . P(a), S — > R 3 3xP(x) ^ 

4, S & P(a) ->R3 3xP(x) & 

5. S & P(a) ->R3 3xP(x), S & Q(b) 

F-axiom EF-axiom 

6. S, Q(b), P(a) — > S 7. Q(b), S, P(a) -> Q(b) > 

8. S, Q(b), P(a) -» S & Q(b) ? 

5. 9. Q(b), S & P(a) -» S & Q(b) 

10. ( R 3 3xP(x)) 3 Q(b), S & P(a) -» S & Q(b) 

11 . (R 3 3xP(x)) 3 Q(b) — » S & P(a)DS& Q(b)~* 3 

1 2. (R D 3xP( x)) 3 Q(b) -» Vx(S & P(x) D S & Q(b)). 


237 In this example, no repaired sequent is later discarded under Case 3a or 3b for a 
two-premise inference (at most duplications are suppressed), nor altered by changing 
e,„.i, . . . , e„ to c,„ (1 , . . . , c„ under Case 9a', 9b', 10a' or 10b'. 


1 ♦ • 
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The E-proof, with formulas replacing the F-part in boldface. 

P(a), R — » 3xP(x), P(a) ^ 

1. R — > 3xP(x), P(a), -iP(a) ^ 

R — > 3xP(x), -iP(a) 

2. R -» 3xP(x), Vx-iP(x) ^ 

3, 4, 5. — » R 3 3xP(x), Vx-iP(x) 7, 8, 9. Q(b) -» Q(b) 

(R 3 3xP(x)) 3 Q(b) -» Vx-iP(x), Q(b) ~ 

10, 1 1 , 1 2. (R 3 3xP(x)) 3 Q(b) Vx-iP(x) V Q(b). 

The F-proof, with formulas replacing the E-part in light face. 
P (a), S -> P(a) 

1 . ~tP(a), P(a), S — > v _ 

2, 3. Vx-iP(x), P(a), S-* 

4. Vx~iP(x), S & P(a) — » ^ 

5. Vx-iP(x), S & P(a) -> S & Q(b) 


6. S, P(a) -> S 7. Q(b), S, P(a) -» Q (b) 
8 - Q(b), S, P(a) — > S & Q(b) 7^ 

5. 9. Q(b), S & P(a) ^S& Q(b) 

10. Vx-iP(x) V Q(b), S & P(a) -> S & Q(b) ^ 

1 1 , Vx-iP(x) V Q(b) -> S & P(a) 3 S & Q(b) 

12. Vx-tP(x) V Q(b) -> Vx(S & P(x) 3 S & Q(b)). 


Theorem 41. 288 (Craig’s interpolation theorem 1957, 1957a.) Suppose 

2,8 Craig 1957, 1957a says that (b) (without the parenthetical version of the hypothesis) 
was first called to his attention by P. C. Gilmore. Using the Gentzen-type system G, 
(b) (with the parenthetical version, and essentially the present proof) is in Kleene 1952 
in the form of Lemma 6 with Lemmas 3 and 4; and that combination of lemmas for 
both the classical and the intuitionistic cases is used repeatedly in Kleene 1952a (pp. 
49, 50, 51, 53, 54). 

We need the hypothesis " E and F contain a common predicate parameter ” in (a) and 
“ neither I — lE nor f F” in (c), because our formation rules §§ 16, 28 give us no way to 
build a formula without its containing some predicate parameter. E.g. if E 3 F is 
(P 3 P) 3 (Q 3 Q), we can’t find an 1 for (a) containing only common parameters; 
if E 3 F is -IP 3 (Q 3 Q) V P, we can satisfy (a) by using -IP for I, but (c) would fail. 

If we adopt formation rules which provide a proposition or predicate constant (like 
t, 1 or =) that does not count as a parameter, then we can dispense with the quoted 
hypotheses. Then e.g. t or ~ if or Vx x = x 3 Vx x = x could be the I for both 
(P 3 P) 3 (Q 3 Q) and "lP 3 (Q 3 Q) V P. 

Lyndon 1959 and Henkin 1963 use such rules (with propositional constants for truth 
and falsity). Lyndon 1959 (although he says he found his result using a Gentzen-type 
system) gives a model-theoretic treatment, in which his “main theorem” is incorrect 
(as shown by Taitslin 1960) but can be mended (according to Henkin 1963). 
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that m the predicate calculus without equality E E D F. Then: 

(a) If H and F contain a common predicate parameter, then there is a 
formula I such that E E D I and H 3 F, and the individual and predicate 
parameters of 1 are common to E and F. 

(h) I K leone 1 052. ) If E and F contain no common predicate parameter 
[or for I.DI without — even if no predicate parameter occurs positively 
in both or negatively in both ), 270 then either 1 — lE or KF. 

(c) (Lyndon 1 95'D.)- 70 For ED F without ~, if neither I — lE nor E F, 
then there is a formula I (without ~) such that E E D I and E I 3 F, and 
the individual parameters of I are common to E and F, and a predicate pa- 
rameter occurs | L ^ J in [ only if it occurs I positive/}} jn h0(hEanilF 

{negatively) J {negatively) 


Prook. We may suppose that EDF contains no variable both free 
and bound. 2:,i> Assume the hypothesis that EDF. Then by Gentzen's 
theorem, E, ;4> — * E D F, whence (by one — O-step upward) E W)il E — ► F. 
We apply Theorem 40 to a given proof of E — * F with the endsequent 
E — » F as the A — » A. 

(a) Now assume that E and F have a common predicate parameter. If 
Case (EF) of Theorem 40 holds, then E E — » 1 and *E 1 — > F where 1 is 
the J for the bottom sequent as the A — * A. By (1), I contains only indi- 
vidual parameters that are common to E and F. By (2), (he predicate pa- 
rameter of each atomic part of I is in each of E and F, by descent from the 
C of an axiom. If Case (F.) or Case (F) holds instead, we begin by picking 
a predicate parameter K which is common to E and F, which we can do 
by hypothesis. Let D be VxK(x x) (or simply K if K takes 0 argu- 

ments). T he following table indicates how in these cases we get F E — ^ 1 
and l I — »F. in 6'4b now. 


When E E — *, take I to be i(D D D). 

hfl > T 

E -> — i(D D D). 


D- 


F, D 


-» F, D D D 
-i(DD D)-» E. 


When 1 — * F, take I to be D D D. 

D. El — * D 


HDD ID. 


DDD->F. 


T-+ 


Again, I ha-* onl\ allowed parameters. Now in any case, using — O and 
Gent/en's theorem lor Theorem 36 Corollary (a)), E E D 1 and E 1 D F. 
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(b) Suppose, for E D F without that no predicate parameter occurs 
positively in both E and F or negatively in both. We shall infer that there 
is no EF-axiom. It will follow that Case (EF) of Theorem 40 cannot apply; 
and Cases (E) and (F) lead respectively to f mE and I- F. So assume, for 
reductio ad absurdum, that there is an EF-axiom C, 1' — » 0, C in the 
given proof of E— >F. Using Lemma 13 §54, say that the Negative 
(antecedent) C in this axiom descends into an image in the E of E — * F, 
and the Positive C into one in the F. Then by Lemma 14 the images are 
Negative and Positive, respectively; so as parts of E and F separately, 
they are both positive, giving rise to positive occurrences of the predicate 
parameter of C in both E and F. Similarly, if the Positive C descends into 
E and the Negative into F, we get negative occurrences of the parameter 
in both E and F. 

(c) Suppose E 3 F does not contain ~, and that neither V n — lE nor 
V u F. Then by Gentzen’s theorem (and one use of — >— i), neither b W4a E — > 
nor h r ; 4;l — > F. So it has to be Case (EF) of Theorem 40 which applies. 
Consider any predicate parameter occurrence in I; say Q is the atomic 
part which contains it. Say C t is positive as a part of 1, and hence Positive 
as a part of E— »l and Negative as a part of I—* F. Applying (2) of 
Case (EF) with Lemma 14, in the proof of E— >1 the part C, descends 
from the second (Positive) C of an axiom C, P K — * 0 K , C, the first (Nega- 
tive) C of which descends into an image C K in the E of E — ► I. This image 
is Negative as a part of E —> I but positive as a part of E. Similarly from 
the proof of I — * F we are led to an image C K of C which is positive as a 
part of F. Thus the parameter (assumed to occur positively in 1) occurs 
positively in both E and F. Similarly, a predicate parameter occurring 
negatively in 1 occurs negatively in both E and F. — 

In the predicate calculus with equality (with or without functions), the 
statement of Craig’s interpolation theorem can be simplified, since the 
symbol = (which does not count there as a parameter) is available for 
building formulas. 

Thkokum 42. (Craig’s interpolation theorem with equality 1957a.) 
In the predicate calculus with equality, if b E Z> F, then there is a formula 1 
such that b E 3 1 and H D F and the parameters of I are common to E 
and F. 

Proof. The method of the proof of Theorem 41 via Theorem 40 does 
not lend itself directly to excluding from 1 function parameters (with > 0 
places) not common to E and F. However with equality available, we can 
first use the idea which we met in § 38 of paraphrasing statements employ- 
ing functions to employ predicates instead. 

Suppose that b E 3 H in the predicate calculus with equality. Let 
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A„ . . . , A,„ he the open equality axioms for = and the other predicate 
and function symbols of E 3 F. By Theorem 31 §29, then F E 3 F in the 
system consisting of the predicate calculus H with only these predicate 
and function symbols and with A„ . . . , A,„ as additional axioms. Say the 
(individual and) function symbols in E 13 F are f t , . . . , f, (k > 0). Call the 
system just described S k . As illustrated in § 45 ((3), we can replace f, c , . . . , f , 
as symbols for functions (whatever functions the symbols express in a given 
model) successively by F A , . . . , F, as symbols for the representing predi- 
cates of those functions. Thereby, we are led to a system S 0 in which the 
result E' 3 F' of paraphrasing E 3 F to use the representing-predicate 
symbols in place of the function symbols is provable. The formula E' con- 
tainsthesame free variables as E, no (individual and) function symbols, 
and (as predicate parameters) exactly the predicate parameters of E and 
the representing-predicate symbols replacing (individual and) function 
sy mbols of E. The parameters of F' are similarly related to those of F. The 
nonlogical axioms B x , . . . , B, of S 0 are the open equality axioms for = and 
for the predicate symbols of E' 3 F', and the formulas 3 !wF,(x,, . . . , x„ , w) 
for each of the representing-predicate symbols F,, . . . , F*. Thus each of 
B,, . . . , B; contains at most one predicate parameter, and that parameter 
occurs in E' 3 F'. Suppose the list B X , . . . , B, chosen so that in the first 
part of it Bj, . . . , B A . (possibly empty) only predicate parameters in E' 
occur, and in the second part B t .,„ . . . , B, (possibly empty) only ones 
in F'. (Predicate parameters in both E' and F' can occur in either part.) 
Since F E' 3 F' in S„, by V-eliminations VB X , . . . , VB, F E' 3 F' in the 
predicate calculus //. Thence by the deduction theorem etc., 

F VBi & . . . & VB* & E' 3 (VB,. +1 & . . . & VB, 3 F') in H. 

Now we reason as for Theorem 41 (a), using the Gentzen-type system 
G4a with only the symbols of S 0 (and therefore no function symbols). 265 
If Case (E) or (F) of Theorem 40 applies, we employ x=x in the role of 
the K(x, . . . , x) there. We obtain thus a formula I' such that in H 

F VB, & . . . & VB* & E' 3 I' and F I' 3 (VB t41 & ... & VB { 3 F') 

and I' contains only individual and predicate parameters common to 
VB X & . . . VB*. & E' and VB a . + j & ... & VB, 3 F', and no function param- 
eters. These common parameters are free variables, which clearly must 
be common to E' and F', and predicate parameters, also common to E' 
and F' by the way we split the list B t , . , . , B,. 

Now VB„ ...,VB t FE'3I' and VB fcll , . . . , VB, F I' 3 F' in H. So 
both E' 3 I' and I' 3 F' are deducible in H from VB X , . . . , VB„ and hence 
are provable in S„. By unparaphrasing, we obtain a formula L such that 
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E 3 I and I 3 F are both provable in S*, and hence in the predicate cal- 
culus with equality. That we can thus get back to the original A lt ...» A m , 
E and F is given by the theorem on paraphrasing. 289 The free variables of 
I are the same as of T and hence are common to E', F', E, F ; and the other 
parameters of I are those predicate parameters of I' which are common 
predicate parameters of E', F', E, F, and the (individual and) function 
symbols of E and F which in the passage from S* to S 0 were replaced by 
representing-predicate symbols common to E', F'. Thus all the parameters 
of I are common to E and F. 

Exercises. 56.1. Treat in the manner of Example 1 1 : 

(a) The proof in (74 in Example 7 § 54. 

(b) The proof of (P(b) 3 S) & P(b) -» VxQ(x) 3 R V Q(b) obtained by 

using in order from the bottom up — *3, 3— > and (in one branch) 

V—* — »V. 

56.2. Show that in the predicate calculus (for F not containing ~): If 
F F, then some predicate parameter occurs in F both positively and 
negatively. 

§ 57. Beth’s theorem on definability, Robinson’s consistency 
theorem. The fifth postulate of Euclid is independent of the other postu- 
lates, i.e. it cannot be deduced by logic from them. This was demonstrated, 
after more than two millenia of speculation, by giving an interpretation 
which makes the other postulates all true and the fifth postulate false. 
The Cayley-Klein model for non-Euclidean geometry 1871 (and for the 
geometry of a bounded part of the plane, Beltrami’s model 1868) is such 
an interpretation (§ 36). Since then, this has become the standard method 
in formal axiomatics of showing the independence of one of a list of axioms 
from the others, or more generally of a given statement from a given list 
of axioms. 

To put this method in our terms, take the case of an axiomatic theory 
formalizable in the (first-order) predicate calculus without or with equality, 
say with X 0 axioms. Say the axioms are expressed by formulas A„, A x , 
A 2i . . . (with their free variables if any having the generality interpretation) 
and the statement to be proved independent of them by F. Let E 0 , E lt 
Eg, . . . be the closures of A 0 , A x , A !( . . .. The traditional method then 
simply applies the consistency theorem (cf. end § 52), 

(E 0 , E x , E„ . . . h F} -*• {E 0 , E x , E 2 , . . . 1= F}, 
with informal use of *12a (contraposition), *82b and *55c (§§ 3, 25). 

IM Theorem 43 p. 417. The present E' and F are the results of applying to our E 
and F the ' of IM p. 41 1 successively with respect to each off,, ... , f,, and I is the result 
of applying to our I' the 0 of IM p. 417 successively with respect to each of Fi, . . . , F,. 
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Similarly, the completeness theorem, 

{E 0 , Ej, E 2 , . . . t= F} {E 0 , E x , E 2 , ... b F}, 

wlien contraposed, says that in principle the traditional method must 
always work: i.e., whenever the statement in question does not follow by 
logic from the axioms, it has to be false under an interpretation which 
makes all the axioms true (cf. § 53). 

Just as we usually wish to have the axioms of a formal theory independ- 
ent, we would also like to have the primitive concepts independent, i.e. 
such that no one of them can be defined from the others. How can it be 

shown that, in the theory based on certain axioms A 0 , A t , A 8 a 

certain concept q cannot be defined from the other concepts p 0 , p u p 2 , . . . 
of the theory. Padoa 1900 used a counterpart of the more familiar method 
of showing axioms to be independent. Namely, he gave two interpretations 
(with the same domain) such that the axioms A„, A 1( A 2 , . . . are all true 
under both, p 0 , p x , p 2 , . . . have the same values under both, but q has 
different values under the two. For, if there were a definition of q from 
p„, Po Pa, in the theory with A 0 , Aj, A 2 , ... as the axioms, that defini- 
tion should determine the value of q from the values of p 0 , p„ p 2 , . . . for 
any values of the latter which are compatible with A„, A,, A g , ... all being 
true. Thus it should not be possible to make q have different values when 
p 0 , Pi- p* 2 ’ • • • have the same values and A 0 , A 1( A 2 , . . . are all true. 

We now study this method in modern terms. We generalize it to discuss 
the definability of q from p„, p„ p 2 , . . . when the closures E 0 , E 1; E 2 , . . . 
of the axioms A„, A„ A 2 , . . .may contain still other parameters r 0 , r x ,r 2 ,. . .. 
Our notation is for the most general case, of N 0 axioms and N 0 parameters 
in each of the two lists. The reader can supply the modifications when one 
or more of these lists are finite or even empty. We take q and all the listed 
parameters to be distinct. For the present, each parameter in the two lists 
(which together with q must include all occurring in the closed axioms 
E 0 , Ej, E,, . . .) shall be a (proposition or) predicate symbol or an individual 
symbol, q shall be an n-place predicate symbol Q (with n > 0), and the 
logic shall be the predicate calculus without equality. 

Padoa’s method is equivalent (by contraposition etc.) to the implication: 
(Q is definable from p 0 , p 1( p 2 , ■ . . in the theory based on E 0 , Ej, E 2 , . . . as 
axioms} -* (each two interpretations of the parameters Q, p 0 , p x , p 2 , . . . , 
r 0 , rj, r 2 , . . . which make E 0 , E x , E 2 , . . . all true and give p 0 , p x , p 2 , . . . the 
same values give Q the same value}, or briefly Dfb E — ► Dfd f . 

First we analyze Dfb K . We take it to mean that a suitable defining 
expression can be given for Q(x x , . . . , x„) (as definiendum). Such an expres- 
sion shall be in the language of the theory, i.e. the predicate calculus with 
the named parameters. It shall contain as free variables only x x , . . . , x,„ 
and as other parameters only ones from the list p 0 , p 1? p 2 , .... of which it 
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can contain only linitely many, say only p 0 , . . . , p,.. Say the defining 
expression or definiens is R(x lt . . . , x,„ p 0 , . . . , p s )- While the logic is the 
predicate calculus, we must have s 0 with at least one predicate param- 
eter among p w> . . . , p,, as a defmiens cannot be constructed without 
using a predicate parameter. Finally, we must express that R(x 1; .... x„, 
p„, . . . , p„) is a deliniens for Q(x,, . . . , x„) in the theory based on E 0 , E b 
E 2 , . . .. This means that, for each domain D and assignment which satisfy 
all of E,„ E,, E.,, . . . , the formula R(x,, . . . , x„, p„, . . . , p s ) shall have the 
same truth value as Q(x b .... x„), for all values in D of x„ . . . , x„. Thus 
we are led to the following rendering of Dfb K . 290 

Dfbjf: For some formula R(x lt . . . , x„, p 0 , . . . , p s ) containing only the 
parameters shown, 

E 0 , E x , E a , . . . != Q(x lt . . . , x n ) ~ R(x b .... x,„ p„, ... , p„). 

We say in this case that ( moilel-tlieoretically ) Q is definable explicitly from 
p 0 , p b p 2 , . . . in the theory based on E 0 , E b E 2 , . . . as the closed axioms, or 
that E 0 , E b E 2 , . . . make Q so definable. Further, for such an R(x b . . . , x„, 
p 0 , . . . , p„), we stiy that (model-theoretically) E 0 , E x , E 2 , . . . define Q explic- 
itly from p,„ . . . , p., as R(Xj x„, p„, . . . , p,). 

We can replace “E 0 , E x , E 2 , . . . b” equivalently by “for some d, E 0 , . . . , 
E <( b” [by the completeness and consistency of the predicate calculus], and 
further by “for some d, b E 0 & . . . & E d 3” [by D-introd. etc.]. Only 
finitely many parameters can occur in E 0 & . . . & E d ; we can increase the 

j in R(x b . . . , x,„ p„ p s ) if necessary so only p 0 p s , r 0 , . . . , r, 

occur in E 0 & ... & E rf . By these steps, Dfb” is equivalent to the following. 


Dfbj,“: For some finite conjunction E(Q, p 0 , . . . , p», r„, . . . , r ( ) of 

E,„ Ej, E 2 , . . . and some formula R(x b . . . , x„, p„ p s ), each 

containing only the parameters shown, 

I- E(Q, p 0 , . . . , p s , r 0 , . . . , r,) D 

[Q(x x , . . . , x„) ~ R(x x , . . . , x a , p 0 p s )]. 


We say in this case that ( proof-theoretically ) Q is definable explicitly from 

(Pc P 1 ’ P"’ ' ' ') in the theory based on (^"’ ^” ’ J, or 

(p 0 , ■ • ■ , P, I (E(Q, p 0 , .... p s , r 0 , ... , r ( )J 

that (w E p. E! v.7p”!, e Q so Fur,h ' r ' 

II, comically) „ f'"’ ,, 0 explicitly from 


E„, E x , E 2 , . . . define 

E(Q, p 0 , ■ • • , p s , r 0 , . . . , r,) defines 


Q explicitly from 


Since E„, E,, E 2 , . . . are closed, 

“E„, E,. If. 1= Q(x,, x„) — R(x„ . . . , x,„ p„ p,)” is equivalent to 

"E,„ E„ E., h Vx, . . . Vx„[Q(x, x„) ~ R(x, x„, p„ p,)l”. and simi- 

larly with “t-” in place or It is convenient to use the shorter expressions with the 
free x,, . . . , x„. 
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p 0 p, as R(x,, . . . , x„, p,„ . . . , pj. The lower version here can be 

used with any formula E(Q, p,„ .... p„ r 0 , . . . , r,) with only the param- 
eters shown, whether or not it is a finite conjunction of E 0 , E,, E 2 , . . 

Next we analyze Dfd,. Let Q\ r', r,', r), . . . be new parameters of the 
same respective hinds (predicate or individual) and numbers of argument 
places as Q, r„, r„ r.>, . . . ; and let E^, EJ, E.j, . . . come from E O' E,, E.,, . . . 
by substituting Q\ r„, rj, r.j, . . . for Q, r„, r„ r.>, .... respectively. I nstcud 
of having two interpretations of Q, p 0 , p t , p 2 , . . . , r 0 , r,, r 2 , . . . which make 
E„, E,. I ... . . . all true and give p„, p,. p a , . . . the same values, it is equiva- 
lent to have one interpretation of Q, Q\ p„. p,, p 2 r„, r„, r,, rj, r 2 , r 2 , . . . 

which make E„. H,j, E,. Ej, E 3 , E.'„ . . . all true. This device leads to the 
following rendering of Dfd,. 

Dfd' 1 : E,„ E', E„ E;, E,, Ej 1= Q(x, x„) ~ Q'(x„ . . . , x„). 

We say in this ease that {mode/- theoretically) Q is defined implicitly from 
P«' Pi- IT. • • • (with r 0 - r„ r.>, . . . as auxiliary parameters) by, or in the theory 
based on, E„, E,, E 2 , . . .. 

By the same transformations as we used on DfbJ.|, this condition Dfd' 1 
is equivalent to the following, where E(Q\ p„, . . . , p,., r,j, . . . , rj) is the 

result of substituting Q', r' rj for Q, in 

E(Q. p„ r„ r,). 

Dfd!’: for some finite conjunction E(Q, p 0 , . . . , p,, r 0 , . . . , r ( ) of E 0 , E,, 
E_„ . . . containing only the parameters shown, 

f 1 .( Q, p () , ... , p,, r„ i ,) & E(Q , p„, , p,, r () , ...» r,) 3 

[Q(x, x„)— Q'(x x„)]. 


We sav in this ease that (proof theoretically) Q is defined implicitly from 

||V Po P-u - I ( u///, ( r< ” r *’ r -’ • * j as auxiliary parameters) by, or ill the 
I Pm • ■ • . P.s j lr 0 r, ) 


. , , |E U , E„ E 2 , . . . \ 

theory based on, . 

(E(Q, p,„ .... p„ r 0 , r,)j 

Summarizing, Padoa's method rests on an implication which (after con- 
traposition) we have rendered in model theory by Dfbj! -*-Dfd} 1 , or 
equivalently (using Dfb'. = Dfbjj and Dfd} 1 = Dfd|’) in proof theory by 


Dfbj; - Did]'. 


It is a simple exercise to corroborate Padoa’s method proof-thcorctically 
by establishing Dfbj.j - >- Dfd j’. In fact, any formula 

E.(Q. p,„ . . . , p„ r r,) which makes Q definable explicitly from 


p„ p, also defines Q implicitly from p,„ . . . , p, (Exercise 57.1). 


In like manner we could inlroducc H(Q, p p„ r,„ . . . , r,) into the model- 

theoretic formulation Dfb)' (also into Dfd) 1 below). If there are only finitely many 
axioms to begin with, E(Q, p p,, r r ( ) can be simply their conjunction. 
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Our analysis thus far has been a straightforward application of results 
which have been available since 1930 (consistency with respect to validity 
e g. Hilbert and Ackermann 1928 bottom p. 73, the deduction theorem 
1930, Godcl’s completeness theorem 1930). 

Now we inquire whether Padoa’s method is complete. That is, will it 
always work, whenever one primitive concept in an axiomatic theory is in 
fact independent of the others? An affirmative answer would be equivalent 
via contraposition to saying that, whenever it is impossible to satisfy the 
closed axioms E 0 , E„ E 2 , ... by two values of Q compatibly with given 
values of p 0 , p,, p 2 , . . . , the resulting implicit model-theoretic dependence 
of Q on p„, p„ p 2 , . . . must be expressible by an explicit definition within 
the syntactical limitations of the language. This is certainly not obvious. 

However, the completeness of Padoa’s method was established with a 
trivial exception by Beth in 1953. 2i>2 The exception is when p 0 , . . . , p, 
include no predicate parameter; without a predicate parameter, we can’t 
build a definiens R(x„ . . . , x„, p 0 , . . . , p„) in the predicate calculus with- 
out equality. The provision for auxiliary parameters r„, . . . , r, in the 
implicit definition not appearing in the definiens, and some other general- 
izations of Beth’s result, were given by Craig in 1957a. 

By the foregoing preliminary analysis, what we need is DfdJ 1 — ► DfbJ!, 
or equivalently Dfd|‘— >• Dfb};. For the latter, it will suffice to show that 
any formula E(Q, p„, p s , r 0 , ..., r,) which (proof-theoretically) 
defines Q implicitly from p„, . . . , p # also makes Q definable explicitly 
from p„, . . . , p,, as we state in: 

Theorcm 43. (Beth’s theorem on definability 1953.) Let the symbolism 
and logic be those of the predicate calculus w ithout equality but allowing 
individuals. Let the notation be as explained above (in particular, each 
formula shall contain only the distinct parameters shown). Suppose that 

H E(Q, p 0 , . . . , p s , Tg, . . . , r ( ) & E(Q , pg, . . . , p s , r u ,. . . , r ( ) 3 

[Q(xj, . . . , x„) ~ Q'(Xj, . . . , x n )], 

/.<?. E(Q, p 0 , . . . , p,, r 0 , . . . , r ( ) defines Q implicitly from p„, . . . , p, with 
r„, . . . , r ( as auxiliary parameters. Then: 

(a) If one of p 0 , . . . , p, is a predicate parameter, there is a formula 
R(xi x,„ p u , . . . , p s ) such that 

h E(Q, p,„ . . . , p,., r 0 , . . . , r,) =3 

[Q(Xi x„) ~ R(x u . . . , x„, p 0 , . . . , p,)], 

i.e. E(Q, p 0 , . . . , p„ r„, . . . , r ( ) makes Q definable explicitly from p 0 , . . . , p,,. 

-“ i Tarski IV 34 dealt with the essentially simpler case of theories based on higher- 
order predicate calculus (type theory). 

The present proof (of Theorem 43), using Craig's interpolation theorem, is essentially 
the same as in Craig 1957a. 
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(b) If none of p„, . . . , p s is a predicate parameter occurring in 
E(Q, p, p,, r„ r,) ( then 

either E E(Q, p 0 , . . . , p„ r 0 , . . . , r ( ) 3 Q(xj, . . . , x„) 
or h E(Q, p 0 , r,) =3 mQ( Xl , . . . , x„), 

i.e. E(Q, p„, . . . , p s , r 0 , . . . , r ( ) defines Q either as the constant predicate 
"truth” or as the constant predicate "falsity". 

Prooi. Using the main hypothesis with the propositional calculus, 

(i) h E(Q, p p,, r r,) & Q(x„ . . . , x„) 3 

[E(Q\ p 0 , p s , r,', r‘ t ) 3 Q'^, . . . , x„)]. 

This we take as the ‘TE3 F” for Craig’s interpolation theorem (Theorem 
41). 

(a,) Suppose one of p„, . . . , p s is a predicate parameter occurring in 
E(Q, p„, . . . , p s , r„, . . . , r ( ). Then by (a) of Theorem 41, there is a form- 
ula 1 such that I E 3 1 and HDF and 1 contains only parameters com- 
mon to E and F. But at most x„ . . . , x,„ p 0 , . . . , p, are common to E 
and F. So letting R(Xj, . . . , x,„ p„, . . . , p„) be this I, the structural require- 
ment is met, and 

(ii) b E(Q, p 0 , • • • > p s , r o> - • • > r r) & Q( x i> ■ • • > x «) 

R( x l> • • • , x n> P(» • • • > p,), 

(iii) h R( x i, • • • , x „, Po- ■ • • . P,) 

[E(Q , Po, . • • , p$» r 0 , . . . , r,) 3 Q (xj, . . . , x„)j. 

The proof of the formula in (iii) remains a proof on substituting in it 
Q, r u r, for Q', r', . . . , rj. 86 So 

(iv) H R(x„ . . . , x,„ p 0 , . . . , p,) 3 

[E(Q, po, . . . , p,, L>, , r t ) 3 Q(xj, . . . , x ,,)]. 

From (ii) and (iv) by propositional calculus, 

(v) I- E(Q, p„, . . . , p s , r 0 , . . . , r,) 3 

[Q( x i, • • • , x„)~ R(x t , .... x,„ po, • . . , p,)]. 

(a 2 ) If one of p 0 , . . . , p s is a predicate parameter, but no predicate 
parameter among p 0 , . . . , p s occurs in E(Q, p 0 , . . • , p s , r 0 , . . . , r ( ), then 
(b) (to be established next) applies, after which we can trivially construct 
an R(x, x„, p 0 , ■ . • , p,)- 

(b) Suppose no predicate parameter among p 0 , . . . , p s occurs in 
E(Q, p 0 , , p„ r 0 , . . . , r,). Then by Theorem 41 (b), 

(vi-1 ) either h ~i[E(Q, p„ p„ r,„ . . . , r ( ) & Q(x„ . . . , x,,)] 

(vi-2) or I- E(Q', p 0 p», i\, r') 3 Q'(x t , .... x J. 
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Using propositional calculus (*60, *49; or IM *58b) on (vi-1), and substi- 
tuting into (vi-2), 

(vii-1) either F E(Q, p 0 , . . . , p„ r 0 , . . . , r,) 3 -iQ(Xj, . . . , x„) 

(vii-2) or h E(Q, p 0 , . . . , p s , r 0 , . . . , r ( ) 3 Q(x 1( .... x„). — 

Now we take the case of the predicate calculus with equality and func- 
tions. The parameter q for Padoa’s method can now be either an n-place 
predicate symbol Q or an n-place function symbol g (with n > 0). 

Theorem 44. (Beth’s theorem on definability, in the predicate calculus 
with equality and functions.) In the predicate calculus with equality and 
functions , and with the notation as explained: 

(A) // 

I- E(Q, p () , .... p s , r 0 , ... , r ( ) & E(Q', p 0 p s , r', . . . , r') 3 

[Q(Xi, • • • , x„) ~ Q'(x„ .... x J], 
then there is a formula R(x x> . . . , x„, p 0 , . . . , p & .) such that 

I - E(Q, ) p<) • • • > ^t) 

[Q( x i. • • • , xj ~ R(x x X ni p 0 p,)]. 

(B) If 

F E(g, p 0 , . . . , p s , r 0 , . . . , r,) & E(s > Po> ■ • * » P»» t"o» • • • > t ( ) 3 

g(Xl X„_,) = g'(x 1 X H _j), 

then there is a formula R(xj, . . . , x„, p 0 , ... , p s ) such that 

F E(g, Pq, . . . , p 5 , Tq, * • • , r ( ) 3 

[g(Xi, . . . , x„_ 1 )=x„ ~ R(x„ .... x„, p 0 , , p s )J. 

Proof, (a) As before, except using Theorem 42, which gives us an 

R(x 1; . . . , x„, p 0 p s ) in all cases. 

(b) In the predicate calculus with equality, 
g(x!, . . . , x„_,)=g'(x 1 , . . . , X rt — j) is equivalent to 

g(x„ . . . , x„. 1 )=x„~g'(x 1 , . . . , x„_ 1 )=x„. Now the above reasoning 
applies with g(x„ . . . , x„_,)=x„ in place of Q(x,, .... x„). — 

As we remarked in § 38, 157 in N with the symbol • and its axioms 20 
and 21 omitted, the representing predicate a-h=c of a-b cannot be 
expressed. Hence by Theorem 44 (B) (contraposed), 251 Padoa’s method 
applies; i.e. there are two models of N, with the same domain D and the 
same assignment to 0, ', -E (with = meaning equality), in which • has 
different functions as values. — 

As a second application of Craig’s interpolation theorem, we establish 
the consistency theorem of A. Robinson 1956, 1963 p. 114. Robinson 
proved this independently of Craig’s theorem, and used it to prove the 
theorem of Beth. 
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I'he problem is this. Suppose we have two consistent formal systems 

and S 2 , with axioms A„, A t , A 2 , . . . and B„, B 2 , . . . and nonvariable 

parameters p„, p lt p 2 , . . . and q 0 , q t , q 2 , . . . , respectively. (The reader may 
substitute finite lists.) Will we obtain a consistent system U S 2 by 
combining S t and S., so the axioms of S x U S 2 are A 0 , B u , A t , B lt A 2 , B 2 , . . . 
and the nonvariable parameters are p 0 , q 0 , pi, q 2 , p 2 , q 2 , . . . (each list with 
repetitions allowed)? 

Not necessarily. For, let S, be number theory N with the addition of 
Godel’s formula - )(?„ (which is true in the standard model, but unprovable 
in N), and S 2 be N with C„ added. The systems S, and S 2 are each con- 
sistent, using the consistency of N (cf. §47, where'S 2 is called “M”). But 
the union S t u S 2 is not. Clearly the difficulty here is that the common 
portion N of Si and S 2 is incomplete, so that in S 2 and in S 2 we could 
extend N in two different ways, each consistent with N, but inconsistent 
with each other. 

This example suggests an additional hypothesis under which we can hope 
to answer the question affirmatively: the two formal systems should be in 
“complete agreement” on topics of mutual concern; i.e., for each closed 
formula I which contains only parameters common to Si and S,, the same 
one of the two formulas l and “il should be provable in each of S x and S 2 . 

The reasoning here, as in Godel’s completeness theorem and most other 
passages in this chapter dealing with N 0 formulas, does not depend on the 
formulas being given effectively, as the axioms of a formal system are 
required to be (cf. §§ 37, 43). So in stating the theorem, we allow S t and 
S 2 alternatively to be sets of formulas based on the predicate calculus, 
analogously to (the set of provable formulas of) a formal system based on 
the predicate calculus, but without the effectiveness requirement for the 
axioms. 

Theorem 45. (Robinson’s consistency theorem, 1956.) Let S 2 and 
S„ be formal systems {or sets of formulas) based on the predicate calculus 
without or with functions and equality. Suppose that S, and S 2 are each 
simply consistent. Suppose they are in complete agreement , i.e. for each 
closed formula 1 in the common portion (or intersection) of their symbolisms, 
one of I and -il is procable in both S x and S a . Then the combined system 
(or union) S, U S 2 is simply consistent. 

Prooi-. Wc begin with the case of the predicate calculus without 
functions and equality but admitting individuals. Let E 0 , E t , E 2 , . . . and 
F 0 , Fj, F 2 , ... be the closures of the axioms of S! and S 2 , respectively. 
Suppose Si u S, is not consistent, so a contradiction K & -iK is deducible 

from H„. F 0 , E„ F„ E 2 , F., in the predicate calculus. Let E„, . . . , E,. 

F u F„ be those of E 0 , F 0 , E^ F lt E,, F,, . . . which are used in a given 
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deduction of K & — iK. Thus E„, . . . , E c , F 0> . . . , F tf b K & "iK in the 
predicate calculus. Thence by propositional calculus, 

l-E 0 &... &E e => -i(F 0 & ... & F a ). 

We take this as the ‘TED F” for Theorem 41. If no predicate parameter 
is common to E 0 & . . . & E c and — i(F 0 & ... & F„), then by Theorem 41 (b) 
h — i(E 0 & ... & E,) or b -i(F 0 & . . . & FJ. But then S, or S 2 , respectively, 
is inconsistent, contrary to hypothesis. If some predicate parameter is 
common, then by Theorem 41 (a) there is a formula l such that 

(i) b E 0 & . . . & E c 3 1 and (ii) b I 3 — i(F 0 & . . . & F d ) 

and 1 contains only parameters common to E„ & . . ■ & E„ and 

-i(F 0 & ... & FJ. So 1 is closed, since E 0 , E^ E 2 , . . . and F 0 , F 1; F 2 , . . . 

are closed. By the hypothesis of complete agreement, in the predicate 

calculus 

either (iii-1) E 0 , E^ E 2 , . . . b I and (iv-1) F 0 , Fj, F 2 , . . . b I 
or (iii-2) E 0 , E x , E 2 , . . . I — »1 and (iv-2) F 0 , F l( F 2 , . . . 1 il. 

But in the first case, (ii) and (iv-1) are incompatible with the consistency 
of S 2 ; for, contraposing (ii) (*13 in § 24), t-F 0 &,.,&F li D ~il. In the 
second case, (i) and (iii-2) are incompatible with the consistency of S x . 

For the predicate calculus with functions and equality, we use Theorem 
42 instead. 

Exercise 57.1. Show that, if E(Q 0 , Po, • • • , p 8 > r o> • • • » r t) defines Q 
explicitly from p 0 , . . . , p„ it does so implicitly. 
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5, 0 

>, 63 

1 


213 

P, Q, R, . 


5, 78 

&. v. 

-t 5, 9. 63-64 

1 

202, 

220-221 

P(-).Q(- 


. . 77-78 



24 

0, ’, + 


202, 207 

a, b, . . . , 

x, . 

78 

Li, 0 


49 

• 202, 

207, 

215; 262 

rr, 

t *£> • 

. . . 202 

-> 


286 

-l 


215 

a, b, ... , 

JC, . 

. . 248 

V 

74, 85, 141 ; 

105 

1 


205, 215 

r, r„ r 2 , . 


78 

V" 


105 

2, 3, . . . 


205 

c„ 


248 

3 

74, 86, 

141 

A, B, C, 


6, 7, 80 

Pr 


213 

3! 


154 

A( — ),B(- 


.80,94,96 

f(-), g<~ 


148 

= 

151, 

165 

.c/, 4, <6' 

1 • • > 

220 

/(— )> 


. . 221 

5*. < 


205 


... 220 

l 


169 




USED OTHERWISE 293 




| 

228; 235; 

298 

= 135, 

157- 

-165, 207 

AG, AGp 


219 


24, 

229 

< 185, 

207, 

254, 268 

f 


9, 85-86 

&, V 


229 



184 


1, . . 

149 


229; 

140 

/ — * 


159 

G, Gp 


215, 218 


r) 

229 


135, 

165, 183 

6/4, // 

306, 

289-290 

1 14, 

26, 88, 101, 

314 

a cr 
— > 

135, 

165, 183 

6'4a, G4b 

, o 

333, 338 

b-' 1 

89, 

314 

{ a , b, . . . 

1 

135, 183 

l(x, y), I,(x), . 

. . 85-92 

( <K) 

“ P 

318 

0 


135, 183 

max, min 


230 

p x i -- x tf 


106 

u, n 

140, 

185, 368 

W, 


243 

h 

34-36, 108, 

314 

.iC(x) 


137, 190 

N 

201- 

-206, 215 

h v ‘ *<- 


108 

M 


184 

Pd, Pdf 


220-221 


59 

', 68 

2-1/ 


184 

Pp 


220 

■ 't-A 

- 136-137, 

146 

0, 1,2, . . 


176, 207 

t 


9, 85-86 

— o, . 

. , 287, 289, 

306 



185 

T(i, a , x) 


243, 248 

->r, . 

. , , C'-> 

333 

0, O', d, d 

' 

268-269 

r, a, . . . 

44, 

, 286, 289 

\ , X 

287, 290, 

306 

° ^ 

49, 

331, 332 

yX a ) 


243-244 

Notations relating 

to Turing machines 


233- 

238, 

242-244 


- * 1 Exceptions occur to this classification-as-to-use. Thus on p. 196 “V” and "3" 
were used as though they were in the second list (before we had Footnote 165); for 
G4 etc. as the object language, “l”\ “A”, . . . belong in the first list; and other exceptions 
are noted in Footnotes 64 and 149. 
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Aandcraa, S. 346. 

abbreviations 22, 154, 202, 204-205, 215, 

218. 

Abelian groups 219, 278, 281. 
abstract: set 183; system 325; — ion 158, 

161, 163. 

Ackerntann, VV. 214, 256, 257, 259; 
Hilbert and — 24, 84, 107, 117, 125, 
283, 285, 322, 365. 
act (Turing machine) 234. 
activation 297. 

active state (Turing machine) 233. 
actual infinity 186, 195-196. 

Adain of Balsham 176. 

Addison, J. W. 200, 278. 
addition (plus) 207, 209. 
aflirniation of the consequent 15. 
algebra 192-193; — ic equation 177, 223, 
229; — ic number 177. 
algorithm 223 -231, 233, 240-241. 
al-Khu\varizmi 230. 
ambiguous: axioms 193; value 76. 
Ambrose, A. 142. 

analysis 194, 201, 258, 265, 330; of a 
proof or deduction 34, 40, 108, 296; 

truth-value — 30, 87. 
analytic: geometry 194; number theory 
197-198, cf. analysis, 
ancestor 296, 337; property 334. 
ancestral image 296-297, 337. 

Andrews, 1*. 346. 

antecedent 286; rules 289, 333; sub- 
formula 337; denial of the - 16. 
applied: logic, predicate calculus etc. 

221-222; to argument 235. 
argument: (logical, verbal) 58-63, 67 72, 
134; of a function 79, 148, 227, 235. 
Aristotelian: categorical forms 138-141; 

syllogisms 125, 145, 147. 

arithmetic 327. 

arithmetical predicate 273. 

assignment 9, 86, 284. 

associative laws 16, 22, 212, 215, 218. 

assumption formula 35, 54. 

atom 5, 62, 79, 149, 201, 220. 

attached variables cf. name. 


Aussonderungsaxiom 189-190, 280. 
autonyinous use of a symbol 6, 202. 
axiom 3, 33, 107, 154, 206, 215 etc., 387; 
schema 33-35, 107, 150, 206, 290, 306, 
387; cf. axiomatic, nonlogical. 
axiomatic: (-deductive) method, theory 3, 
33, 34, 1 9 1 -20 1 , 306 ; set theory 189-191, 
194, 280, 323-324. 

Bachmann, H. 183. 

Ball, W. W. Rouse 198. 

Barbara 145. 

Bar-Hillel, Y. 189, 265. 

Baroco 125, 145. 

Bauer-Mengelbcrg, S. 374, 379. 
belonging: (C) 135, 166, 183, 191; (logical 
analysis) 296, 338. 

Beltrami, E. 192, 361. 

Benacerraf, P. 371. 

Bcrnays, P. 107, 189, 257, 280; Hilbert 
and — 7, 81, 157, 170, 194-195, 203, 
214, 252, 256, 263, 285, 294, 307, 321, 
323, 345, 346, 349. 

Bernstein, A. R. 330. 

Berry’s paradox 188, 189, 190. 

Beth, E. W. 284, 285, 287, 306, 332, 336, 
365, 367; — ’s theorem 365-367. 
biconditional 70. 

Bochenski, I. M. 16. 

Bolyai, J. 8, 192. 

Bolzano, B. 176. 

Boole, G. 32, 48. 199. 

Boone, W. \V. 264, 265. 

Borel, Is. 278. 

bound: variable (occurrence) 80-81; cf. 

congruent. 

Britton, J. L. 264. 

Brouwer, L. E. J. 195-198, 302. 

Biichi, J. R. 264. 

Buraii-Forti paradox 186, 189. 
cancellation 218. 

Cantor, G. 162, 175, 182-189, 254, 256; 
— 's diagonal method 180-183, 188; 
— 's paradox 187, 189, 191; — ’s 
theorem 185, 187, 191, 323. 
cardinal number 175, 184, 185, 254. 
Carnap, R. 6, 35, 200. 
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Carroll, I .mis 70. 

cases, proof by 28. 44. 

categorical: axioms 193, 326; forms 

138-141. 

Cayley, A. 192, 361. 

chain: inference 16, 60; of equalities 156, 
212, 217; of equivalences 20, 124, 125, 
156; of implications 24; of inferences 
62,70 71. 

choice, axiom of 171, 190, .321, 346. 
Church, A. 15, 24, 35, 48, 84, 107, 226, 
232, 241, 246, 254, 260, 264, 283; — ’s 
theorem 246-247, 272; — ’s thesis 232, 
238-241, 245 248, 250, 252, 258, 264, 
for partial functions 244, relativised 
267; converse of — ’s thesis 241. 
Claplnim, C. R. .1. 265. 

Clark, R. 62, 64. 
class 135, 191, 323, cf. set. 
classical: logic etc. 8, 33, 49, 62-63, 83, 
85, 117, 1 94-195, 197-198; cf. intui- 

tionistic. 

closed: formula 105; path 287, 299, 308; 

tree 288; vs. open axioms 207-208, 319, 
closure 105. 

code (Turing machine) 242. 

Cohen, l\ .1. 190. 

commutative: group 219, cf. Abelian; 

laws 16, 219. 

compactness 3 1 2, 322, 324. 
complement 140. 

complete: agreement 368; axioms 324-326; 
predicate 266; system 43, 214; Post — 
system 50; simply — system 251. 
completed infinity 186, 195-196. 
completeness (results); explicit definability 
365; predicate calculus, cf. Godcl’s — 
theorem; propositional calculus 48, 
298 ; cf. complete, in — ness, 
composite formula 5, 79. 
computable function 228, 230-232, 235- 
241, 244-248, 257. 
computation: problem 226, 230; procedure 
22<T23I, cf. algorithm; truth-table — 
12, 29, 88; Turing machine — 237. 
computing machines 7, 29, 201, 211, 225, 
233, 265; cf. Turing machine, 
conclusion 34. 

condition (on a variable) 104. 
conditional 70; equation, interpretation 


103-105, 208, 319; contrary-to-fact 

— 10 . 

configuration (Turing machine) 234. 
congruent formulas 82, 100, 124, 154, 315. 
conjunction 5-7, 9, 15-16, 20-24, 27, 28, 
30-32, 39, 43, 44, 63-66, 121, 122, 128, 
143-144, 289, 314; introd. and dim. 44. 
conjunctive normal form 32. 
consequence cf. inference, valid — . 
consistency proofs: for axiomatic theories 
(number theory etc.) 194-197, 214, 
254-259, 261, 273-275, 281, 320, 331, 
342, 346, 368; for predicate calculus 
116 117, 154, 201, 306, 318, 333; for 
propositional calculus 43; cf. con- 
sistent. 

consistent: formula 28-29; list of formulas 
50; system 43; simply — system 43, 
254; to— system 274; cf. consistency, 
constant 104, 136, 357; variables held — 
104, 106, 108, 136, 205, 314; cf. in- 
dividual symbol, 
constructivity 195, 241, cf. Unitary, fini- 
list, intuitionislic. 
contingent formula 28, 69, 
contraction 333. 

contradiction: denial of — 16; cf. para- 
doxes. 

contradictory formula 28. 
contraposition 15-16, 59-60, 121. 
contrapositive 13, cf. contraposition, 
converse 13, 15; of Church’s thesis 241. 
correspondence 77; 1-1 — 175, 324. 
countable set 176, 324. 
countably infinite set 1 76. 
counterexample 284. 

Craig, W. 284, 349, 357, 359, 365; inter- 
polation theorem, — ’s lemma 357-359. 
Curry, H. B. 335, 342. 
cut 331-332, 334, 346. 

D-: falsifiablc, satisfiable 284; valid 89, 

314. 

Davis, M. 211,233, 269, 374. 
decidable: predicate 228, 243, 247, 248; 

formally — formula 251. 
decimal fractions 162, 181. 
decision: problem 223-225, 229, 230, 
264-267, 278; procedure 223-229, 247, 
for propositional calculus 224, 335. 
Dedekind, R. 177. 
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dcduciblc formula 35-36, 43, 108, 116, 

205, 314. 

deduction 33, 35-36, 42, 51, 108, 205; 
theorem 39, 42, 44, 112, 126; natural 
— 126, 136; cf. subsidiary, 
deductive: method 3, 33, 34; rules 35, 

199, 205. 

definability 362-365; 2 — 232, 241, 246, 
264; cf. descriptions, recursion, 
definiendum 362. 
definiens 363. 

definite: article, description 167-171, 214, 
cf. paraphrasing, 
definition 70, 84; descriptive 167; explicit, 
implicit 362-365; recursive 209, 238, 

261. 

degree (of unsolvability) 267-272, 295 ; 

higher — , lower — 267. 
Dehn, M. 264, 381. 

DeMorgan, A. 16, 48, 147; — ’s laws 16. 
denial of the antecedent 1 6. 
denial of contradiction 16. 

Denton, J. 257, 346, 349. 

denumerable set 176. 

dependent variable 74. 

derived rules 51, 117, 125-127, 211; cf. 

introduction. 

Descartes, R. 194. 

descendant 296-297, 337; image 296-297. 
descriptions (the, a) 167-172, 214. 
destructive dilemma 61. 
detachment, rule of 34. 
digits, method of 178-179, 243, 252. 
direct: rules 51, 109, 155; vs. in — 
methods or proofs 57, 196, 335-336. 
discharge of an assumption 54. 
disjoint: cases 31; sets 159, 190, 275. 
disjunction 5-7, 9, 11, 15-16, 21-24, 28, 
30-32, 44, 61, 64-66, 121, 122, 128, 
143-144, 289, 314; introd. and elim. 44; 

Herbrand — 345, 349. 
disjunctive normal form 32. 
disjunctive syllogism 61. 
distributive laws 16. 
divisibility 213, 228. 

Dodgsou, C. L. 70 

domain 84, 134-139, 162-164, 191, 294; 

cf. D-, Lowcnheim-Skolem theorem, 
double negation: elimination 44, 50; law 
of — 15-16, 49. 


Dreben, B. 257, 322, 346, 349, 376. 
duality 22-25, 101, 132, 353. 

Duhcm, P. 176. 

E-, EF-, F-: axiom, part, proof 349-350; 

inference, premise 352-353. 
effective: process, — ly calculable function 
231, 232, 241, 252. 
element (e) 135, 165, 183, 191. 
elementary number theory 201, 214. 
elimination: laws etc. 16, 21, 22, 30-32, 
122, 127; of cut 331; of descriptions 
(functions) 170, 214, cf. paraphrasing; 

cf. introduction, 
empty: domain, set 84, 135, 139, 183. 
endsequent 306. 
enthymeme 67-68, 147. 
enumerable set 176; recursively — 275. 
enumeration 176, 275, 281; theorem 269. 
Epimenidcs 188. 

equality 135, 151, 157-158, 184, 207, 
3 1 5-3 1 6, 357 ; axioms 1 54, 1 57, 209-21 2, 
215-218, 320; Leibniz’ definition of — 
163-164; theory of — 279. 
equivalence (between objects); classes 
159-165, 317; of sets 184; equality vs. 

— 157-165. 

equivalence (between propositions): 5-7, 
9-10, 16, 18-25, 30-32, 45, 63, 70, 
121-123, 155-156, 289„337; introd. and 
dim. 45; formal — 138; logical — , — 
in a system 20, 31, 70; material — 9, 10, 
70, 138, 158. 

erasure (Turing machine) 234, 236. 
essential undecidability 277-281. 

Eubulides 188. 

Euclid: (axiomatic-deductive method, — ’s 
dements) 3, 33, 158, 176, 191, 198; — ’s 
algorithm 224, 229, 230; — ean vs. 
non — ean geometry, — ’s parallel (fifth) 
postulate 8, 191-192, 194, 328, 361; 

— ’s theorem on primes 213. 

example 284. 

excluded middle, law of 16, 48, 49, 196, 

258. 

existence: introd. and dim. 118-119, 126, 
171; classical vs. intuitionistic — 195— 
196. 258; unique — 154, 167; cf. 

quantifiers. 

existential quantifier, cf. existence, 

quantifiers. 
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explicit definition 362- 363, 365. 
exportation 15, 16. 

extension 137; al concept 137, 164, 229; 
of a function 247; of a system 253, 276; 

axiom of — ality 1 89. 
extra formula 295-296. 

F- of. II-. 

falsiliability 284, 290, 297; D — 284. 
falsifying assignment 284. 
falsity 8, 62-63, 85-86, 357, 366. 
Fefernian, S. 254, 255. 

Fermat’s “last theorem” 225, 227, 283. 
Festino 125, 145. 

Feys, R. 49. 

finitary methods 195, 200. 

finite: axiomaii/alion 279; group 220; 

set 176-177. 

finitely axiomatizable theory 278-280. 
finitist mathematics 259. 
first-order: logic, predicate calculus 85, 
322, 327; cf. order, 
formal: axiomatics, axiomatic theory 192- 
195. 198-199; deduction, proof, etc. 
36 37, 42, 199, 205-206, 211; equiva- 
lence. implication 138; expression 202, 
203; number theory 201-206 (results 
§§ 38, 43, 44, 47, 53); symbol 202, 
215. etc.; system 198-201, 252-254, 

306. 

formalism cf. formal system, 
formally decidable formula 251. 
formation rules 35, 199, 205. 
formula 4 5, 79, 149, 168, 201, 203, 215 
etc., 220-221, 224. 

Fraenhel, A. A. 183, 189 -190, 280. 
free: substitution 94, 98 -100; term 

(occurrence) 150; variable (occurrence) 
80-81; — for 94, 150, 206. 
Frege, G. 36, 48, 184, 199, 267, 283. 
Fricdberg, R. M. 272. 
function 74-75, 148-149; letter 221; 

parameter 284; symbol 169, 203, 221; 
variable 220; logical - - 85, 89, 229; 

propositional — 74. 
functional 75; calculus 75. 

Galileo’s “paradox” 175-177. 

Gauss, C. F. 1 86. 

generality: interpretation 103-105, 109, 
208, 319; cf. quantifiers, 
general properties of l 52. 116. 


general recursive function cf. recursive, 
genetic approach 195. 

Gentzen, G. 35, 44, 126, 256, 257, 306, 

331, 3.32, 335, -336, 342, 345; — 's 
Hauptsat/ (normal form theorem) 331- 

332, 336; — ’s normal form for proofs 
335, 339; — ’s sharpened Hauptsatz 
342; — type systems 127, 306, 331-336, 

338, 342; cf. consistency. 
Gilmore, P. C. 357. 

Gbdel, K. 117-118, 190, 191, 213, 214, 
232, 250, 252-253, 257, 273, 280, 
284-285, 305, 307, 312, 314, 316, 318, 
321-322, 327, 365; — ’s completeness 
theorem 117-118, 284, 285, 305, 311, 
314, 315, 318-324, 327-328, 332, 346; 
— ’s (incompleteness) theorem, includ- 
ing generalized versions 214, 250-254, 
257, 271-275, 327-328; — numbers 
252; — ’s second theorem 214, 255-256, 
274; symmetric form of — ’s theorem 

275. 

group 216; theory 193, 215-220, 264, 278, 

281. 

Haken, W. 264. 

Hall, M. 216. 

Hcnkin, L. 200, 285, 307, 327, 329, 330, 

346, 357. 

I lerbrund, J. 39, 126, 232, 256, 257, 322, 
331, 336, 343, 345-346; disjunction 345, 
349; functions 343; -’s theorem 331, 

336, 345-349. 

Hcrmite, C. 232, 

Hertz, P. 126, 306. 

Heyting, A. 162, 196, 259. 
hierarchies 269-272, 278, 295, 328. 
higher cf. degree, order. 

Higntan, G. 264. 

Hilbert, D. 194-201, 214, 225, 229, 253, 
256-257, 327, 330; and Ackermann, cf. 
Ackermann; and Bernays, ef. Bernays; 
arithmetic 327; — type systems 126, 
306, 332, 334-336; cf. consistency. 
Hintikka, K. J. J. 285. 

Holkot, R. 176. 
hypothetical syllogism 60. 
ideal statements 197, 257. 
idenipotent laws 16. ’ 
identical equation 103. 
identically: false 28; true 12, 104. 
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identity cf. equality; element in a group 
2 1 5-2 1 9, of indiscernibles 1 63 ; principle 

of — 16. 

image 296-297, 337. 

immediate: ancestor, descendant 296; 

subformula 334. 
implication 5-7, 9-10, 15-16, 18, 22-25, 
30, 32, 38, 39, 42, 44, 63, 69-70, 112, 
114, 121, 122, 124, 128, 138, 289; 
inlrod. and elim. 44; formal — 138; 
logical — , — in a system 70; material 
— 9,10, 70,138. 
implicit definition 364-365. 
importation 15-16. 
improper cf. proper. 
inclusion(£) 135, 165, 183. 
incompleteness (results) 250, 323, 327; 
cf. Godel’s — theorem, Lowenheim- 
Skolem theorem, nonstandard, 
inconsistent 69, cf. consistent, 
indefinite: article, description 143, 171 — 

172. 

independent: postulates 193, 361 ; primitive 
concepts 362; variable 74. 
index (Turing machine) 243. 
indirect cf. direct. 

individual 74, 148; parameter 284; symbol 
203, 284, 319; variable 85. 
Induction 209, 212, 256, 329. 
inference 34; cf. rules of — . 
infinite set 176-177, cf. infinity, 
infinity: actual (completed) vs. potential 
(uncompleted) 186, 195-196, 202, 233; 

axiom of — 190. 
informal: logical symbolism 24, 229; cf. 

formal. 

integer 159, 176, 216, 280. 
intended cf. standard, 
intension 137; — al concept 137, 164, 229. 
interchange: lemma 341; of inferences 
339-341; of premises 16. 
interpolated formula 350. 
interpolation theorem 349, 351, 357-359. 
interpretation (of languages generally) 
192-195, 200, 207, 249, 257, cf. model, 

standard. 

interpretation (of variables): conditional 
vs. generality — 103-105, 109, 208, 319; 
name form — , predicate — 77. 

intersection 1 40, 368. 


introduction: and elimination of logical 
symbols (implications) 15-16, 35, 107 
(derived rules) 44-45, 50-57, 118-119, 

125- 127; of symbols into implications 
and equivalences 121 ; Gentzen — rules 

306, 289. 

intuitionistic vs. classical: (logic etc.) 49, 
50, 117, 132-133, 162, 195-198, 222, 227, 
257-259, 270, 272, 274, 332, 342, 351. 
inverse in a group 215-219. 
ion 77-79, 201, 203, 220-221. 
irreflexivity 145, 185, 254, 268. 
isomorphism 325. 

Jaskowski, S. 84, 126, 336. 

Kalmar, L. 45, 240, 316. 

Kanger, S. 285, 306. 

Keisler, H. J. 67 
Kemeny, J. G. 329. 

Ketonen, O. 306. 

Kleenc, S. C. 15, 30, 36, 81, 196, 213, 232, 
234, 238, 240, 242, 243, 248, 250, 252, 
254, 257, 258, 259, 264, 267, 269-270, 
272, 273, 275, 278, 285, 302, 321, 322, 
327, 328, 332, 337-338, 341-342, 346, 

357-358. 

Klein, F. 192, 361. 

Kolmogorov, A. N. 257. 

Konig’s lemma 302, 304. 

Kreiscl, G. 254, 257, 259, 349. 

Lacombe, D. 272, 275. 
lambda-definability 232, 241, 246, 264. 
Langford, C. H. 9, 49. 
language: computer — s 7, 201, 265; 
meta — , syntax — , 199-200, cf. meta-; 
object — , observer’s — 3, 200, cf. 
object — ; verbal, natural, ordinary 
— 5-6, 58-59, 199-200, cf. translating; 

cf. symbolism. 

Lazerowitz, M. 142. 

Leibniz, G. W. v. 163-164. 

Lewis, C. I. 9, 49, 163. 

Liar, The 188, 189, 190, 253. 
linearly ordered set 254. 

Liouville, J. 182. 

Lobatchevsky, N. I. 8, 192, 194. 
logic (aims and methods) 3-5, 8, 33, 
36-37, 50, 59, 61, 70-72, 117, 118, 

126- 127, 199, 211, 283, 305-307, 322, 
335-336, 338; cf. predicate calculus, 

propositional calculus. 
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logical: axioms .1.1. 199, cf. non — ; cal- 
culus 278; equivalence, implication 20, 
70; function 85, 89, 229; inferences, 
rules 331; symbol 79; symbolism 5-6, 
23,79,199-200,229. 
logistic: method, system 199. 

I .ijwenlu'ini^J 225 t 294, 3J.L 321, 343; 

" -Sholem theorem 311-312, 316, 318, 

321-323. 

lower: ilegiec 267; predicate calculus 85. 
Lukasiewicz, J. 16, 48, 49, 336. 

Lusin, N. N. 278. 

Lyndon, R. C. 349, 357 -358. 

MacColl, II. 49. 

MucDuffee, C. C’. 224. 

Mal'cev, A. I. 307,330. 

Markov, A. A. 233, 264. 
material: axiomatics 191; equivalence, 
implication 9-10, 70, 138, 158. 
mathematical: axiom 207; induction 212; 

logic 3. 

member (6) 135, 165, 183, 191. 

Mcndelson, E. 240. 
meson 148, 201. 203, 220-221. 
meta-: language 3, 199, 200, 205-206; 
mathematics 195, 199-201, 205-206, 
283, 331 ; theory 199. 

midsequent 342. 

Mine, G. E. 378. 

modal propositional calculi 49. 

model 3_3- 61, 194, 319-320; theory 33, 48, 

50, 61 , **l I^TlS, 200, 283, 318-322, 

.3.32; sub - 322; cf. standard, 
modus ponens 34,44, 61. 107, 126, 334-336, 

346. 

modus tollcns 61. 
molecule 5, 79. 

moment (Turing machine) 233. 

Mostowski, A. 84. 208, 269, 272, 278, 280, 
281, 285, 295. 

Mucnik, A. A. 272. 

multiplication (times) 9, 207, 209, 216- 

217. 

multiplicative axiom 190. 

Nagel, E. 253. 
naive set theory 1 89. 
name form 77, 78-79, 97, 148. 
natural: deduction 126, 336; numbers 176, 
185, 209, 329-330; cf, language, 
negation 5-7, 9, 15-16, 21-24, 44, 49, 50, 


57, 64, 127, 131, 133, 196, 289; ini rod. 

and elim. (double, weak) 44. 
negative (Negative) part of a formula 
(sequent) 124, 337 (337). 

Nelson, D. 259. 

Newman, J. R. 253. 
non-Euclidean geometry cf. Euclid, 
nonlogica) axioms 206-208, 278, 319. 
nonstandard: analysis 330; arithmetic 
326-330; model 327; set theory 324. 
nonterminating decimal 181. 
normal form: conjunctive — , disjunctive 
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